Making LLM Training Faster with Unsloth and NVIDIA
segmenta
114 points
21 comments
May 07, 2026
Related Discussions
Found 5 related stories in 94.4ms across 8,303 title embeddings via pgvector HNSW
- Executing programs inside transformers with exponentially faster inference u1hcw9nx · 17 pts · March 12, 2026 · 57% similar
- Unsloth Studio brainless · 233 pts · March 17, 2026 · 56% similar
- MegaTrain: Full Precision Training of 100B+ Parameter LLMs on a Single GPU chrsw · 280 pts · April 08, 2026 · 54% similar
- Right-sizes LLM models to your system's RAM, CPU, and GPU bilsbie · 76 pts · March 01, 2026 · 53% similar
- Train Your Own LLM from Scratch kristianpaul · 52 pts · May 05, 2026 · 52% similar
Discussion Highlights (3 comments)
stared
While I do admire Unsloth (especially their https://huggingface.co/unsloth/Qwen3.6-35B-A3B-GGUF binarizations), the linked blog post looks like written by AI from notes (unless a human author acquired this taste from interactions with chatbots).
electroglyph
nice writeup! looking forward to doing some more training as soon as i get some more data sorted. it'll be a custom arch, but i'll probably shoehorn it into unsloth for a speed boost.
wiradikusuma
Quick question, for average joe do we still need to "train" LLM or we can just use off the shelf model and use it ("inference"?) for normal use cases like business process augmentation (e.g. helping read paper receipts, or generate cat videos)?