Platinum-CoT: High-Value Technical Reasoning. Distilled via Phi-4 → DeepSeek-R1 (70B) → Qwen 2.5 (32B) Pipeline

I’ve just released a preview of Platinum-CoT, a dataset engineered specifically for high-stakes technical reasoning and CoT distillation.

What makes it different? Unlike generic instruction sets, this uses a triple-model “Platinum” pipeline:

Architect: Phi-4 generates complex, multi-constraint Staff Engineer level problems.
Solver: DeepSeek-R1 (70B) provides the “Gold Standard” Chain-of-Thought reasoning (Avg. ~5.4k chars per path).
Auditor: Qwen 2.5 (32B) performs a strict logic audit; only the highest quality (8+/10) samples are kept.

Featured Domains:

– Systems: Zero-copy (io_uring), Rust unsafe auditing, SIMD-optimized matching.

– Cloud Native: Cilium networking, eBPF security, Istio sidecar optimization.

– FinTech: FIX protocol, low-latency ring buffers.

Check out the parquet preview on HuggingFace:

Platinum-CoT: High-Value Technical Reasoning. Distilled Via Phi-4 → DeepSeek-R1 (70B) → Qwen 2.5 (32B) Pipeline