Open Reproduction of DeepSeek-R1
yogthos
221 points
18 comments
June 11, 2026
Related Discussions
Found 5 related stories in 95.7ms across 10,324 title embeddings via pgvector HNSW
- DeepSeek V4: The Open-Source Model Frontier Labs Feared HelloAi · 61 pts · May 15, 2026 · 66% similar
- DeepSeek-V4 Technical Report [pdf] tianyicui · 19 pts · April 24, 2026 · 61% similar
- DeepSeek v4 impact_sy · 455 pts · April 24, 2026 · 60% similar
- Notes on DeepSeek vinhnx · 169 pts · June 10, 2026 · 59% similar
- DeepSeek-V4: a million-token context that agents can use ibobev · 12 pts · April 28, 2026 · 57% similar
Discussion Highlights (6 comments)
Tiberium
Last update over a year ago, so I hope (2025) gets added to the title: > [2025/05/26] (Step 1 completed!) We release Mixture-of-Thoughts--a curated reasoning dataset of 350k verified traces distilled from R1. The dataset spans tasks in mathematics, coding, and science, and is designed to teach language models to reason step-by-step. We also provide a recipe to train OpenR1-Distill-7B, which replicates the reasoning capabilities of deepseek-ai/DeepSeek-R1-Distill-Qwen-7B and marks the completion of step 1 in the Open R1 project. Doesn't look like they managed to actually reproduce R1, and only stopped on Step 1 out of their 3-step plan.
madiator
Check out OpenThoughts. It has a widely used dataset, a model that beats the deepseek's smaller reasoning models, and a paper that talks in detail about the data curation methodology. https://www.open-thoughts.ai/
christkv
What is the estimated cost these days to train something like this to conclusion?
yieldcrv
Too old now
aesthesia
If you really want to see fully open training pipelines for modern LLMs, Olmo and to a lesser extent Nemotron are what you should look at. https://github.com/allenai/OLMo https://github.com/NVIDIA-NeMo/Nemotron
poppafuze
"This will likely involve curating new, large-scale datasets for math, reasoning, and code.". ... everybody likes to hand-wave on this .