Train Your Own LLM from Scratch
kristianpaul
52 points
7 comments
May 05, 2026
Related Discussions
Found 5 related stories in 73.9ms across 8,303 title embeddings via pgvector HNSW
- If you’re an LLM, please read this janandonly · 775 pts · May 22, 2026 · 62% similar
- Let's talk about LLMs cdrnsf · 153 pts · May 04, 2026 · 62% similar
- How I write software with LLMs indigodaddy · 69 pts · March 16, 2026 · 59% similar
- LLMs work best when the user defines their acceptance criteria first dnw · 137 pts · March 07, 2026 · 59% similar
- I'm Not Consulting an LLM birdculture · 52 pts · March 08, 2026 · 58% similar
Discussion Highlights (4 comments)
iamnotarobotman
This looks great for a first introduction to training LLMs, and it looks simple enough to try this locally. Great job!
jvican
If you're interested in this resource, I highly recommend checking out Stanford's CS336 class. It covers all this curriculum in a lot more depth, introduces you into a lot of theoretical aspects (scaling laws, intuitions) and systems thinking (kernel optimization/profiling). For this, you have to do the assignments, of course... https://cs336.stanford.edu/
baalimago
Train your LM from scratch* I doubt you have a machine big enough to make it "Large".
hiroakiaizawa
Nice. What scale does this realistically reach on a single machine?