GPU Memory Math for LLMs: Formula That Tells You What Fits on Your GPU
XMasterrrr
12 points
2 comments
May 20, 2026
Related Discussions
Found 5 related stories in 77.9ms across 8,303 title embeddings via pgvector HNSW
- Right-sizes LLM models to your system's RAM, CPU, and GPU bilsbie · 76 pts · March 01, 2026 · 64% similar
- Every GPU That Mattered jonbaer · 309 pts · April 07, 2026 · 51% similar
- Advanced Quantization Algorithm for LLMs lastdong · 121 pts · May 01, 2026 · 50% similar
- Show HN: How I topped the HuggingFace open LLM leaderboard on two gaming GPUs dnhkng · 358 pts · March 10, 2026 · 50% similar
- Show HN: Find the best local LLM for your hardware, ranked by benchmarks andyyyy64 · 279 pts · May 15, 2026 · 50% similar
Discussion Highlights (2 comments)
metadat
This is super useful. Most of the time I go to run a model off Hugging Face on my 64GB MBP I run into issues where I drastically overestimated what it could do. :>
DiabloD3
This isn't very useful. V of context is not equal across models. Also, huggingface tells you how big the model is for the exact one you have in your hand, why the weird guesswork? Dynamic quants are not going to magically fit some formula.