I’ve just released a preview of Platinum-CoT, a dataset engineered specifically for high-stakes technical reasoning and CoT distillation.
What makes it different? Unlike generic instruction sets, this uses a triple-model “Platinum” pipeline:
- Architect: Phi-4 generates complex, multi-constraint Staff Engineer level problems.
- Solver: DeepSeek-R1 (70B) provides the “Gold Standard” Chain-of-Thought reasoning (Avg. ~5.4k chars per path).
- Auditor: Qwen 2.5 (32B) performs a strict logic audit; only the highest quality (8+/10) samples are kept.
Featured Domains:
– Systems: Zero-copy (io_uring), Rust unsafe auditing, SIMD-optimized matching.
– Cloud Native: Cilium networking, eBPF security, Istio sidecar optimization.
– FinTech: FIX protocol, low-latency ring buffers.
Check out the parquet preview on HuggingFace:
submitted by /u/BlackSnowDoto
[link] [comments]