Mini PC for local LLMs in 2026
charlieirish
31 points
32 comments
May 02, 2026
Related Discussions
Found 5 related stories in 95.4ms across 8,303 title embeddings via pgvector HNSW
- Show HN: Find the best local LLM for your hardware, ranked by benchmarks andyyyy64 · 279 pts · May 15, 2026 · 60% similar
- Running local LLMs offline on a ten-hour flight darccio · 118 pts · April 27, 2026 · 57% similar
- Old laptops in a colo as low cost servers argentum47 · 206 pts · April 09, 2026 · 55% similar
- Personal Computing (2022) xk3 · 16 pts · March 22, 2026 · 53% similar
- Llm9p: LLM as a Plan 9 file system mleroy · 15 pts · March 08, 2026 · 52% similar
Discussion Highlights (12 comments)
znpy
As somebody that has a vague interest in running local LLMs… they day i decide to burn cash on hardware I might as well go all-in a get either a 128gb mac studio or an nvidia dgx spark (or some other equivalent gb10-based system). The 64gb mac mini is also interesting, if anything because it is very likely to hold most of its value when reselling. I’m keeping an eye on the next apple hardware refreshes, particularly for mac minis and mac studios.
pjmlp
Currently NVidia's mini PC, or the version licensed to Asus, is one of the few that I can actually buy with Linux pre-installed with a fully OEM supported version. One would expect that by now buying desktop class computers on shops with a Linux experience would be rather common. Geekcom devices that it advertises as Linux ready, are actually sold with Windows pre-installed. I guess they mean WSL ready.
alexktz
Could we post articles that are obviously written by an LLM with a flair?
bachmeier
"Local inference is rarely cheaper if you’re being honest with yourself about how much you actually use it." Sorry, but this is not even close to "being honest", it's bad math. That calculation assumes you do nothing with the computer other than local inference.
bluechair
“What’s the memory bandwidth (GB/s) of the device holding the model weights?” Isn’t the recommended option going to be dog slow at 256 GB/s.
croes
> 128GB Ryzen AI MAX+ 395, listed at $2,099. Wasn‘t that a discounted price?
dannyw
> The 256 GB/s number is real, but for context, an Apple M5 Ultra hits ~800 GB/s on its unified memory The M5 Ultra has not been even announced. This article appears to be predominately or entirely LLM-produced with little to no human review, and contains numerous material and misinforming errors. It also omits serious contenders that's worth at least comparing, like the DGX Spark.
jcgrillo
I got a well used HP Z840 with 256GB ECC DDR4 and twin Xeons ca. 2014. Then I slapped 2 AMD V640 32GB passively cooled GPUs in it with some 3D printed fan shrouds and 2 1U 15k rpm fans each. They just fit! I needed to order a quad 8pin power cable, the standard configuration has 3 6pin cables--but there's unused pins on the GPU power rail, and there are aftermarket suppliers. 72 Xeon cores 256GB ECC DDR4 64GB VRAM $2200 total I run it on a 20A 240V outlet to make sure the power supply can deliver enough watts, but so far it's working pretty well. The eWaste LLM rig is probably not as good value for money as a new machine, but it gets the job done cheaper (for now). EDIT: IIRC this approach gets me more VRAM bandwidth than Strix Halo at the cost of less addressable GBs (but a lot more total system RAM), but I figured with CPU offloading that might make up for it? ALSO EDIT: Note you can get a 128GB Strix Halo motherboard minus power supply, fans, case, etc from Framework for $2200.. that could work if you have some parts lying around.
lkey
This article was authored by AI. It contains hallucinated info from compilations of random reddit threads.
mark_l_watson
I bought a 32G MacMini over two years ago and it has been great for experimenting with local models, and now is even useful for local coding (at a slow speed!) with models supporting large context sizes. With the current extreme RAM shortage I deeply regret not buying a 64G MacMini a few months ago. I bet a zillion people feel the same way.
visarga
Good research, but man do I feel the LLM vibe shining through. That sustained information density...
jmyeet
There's some mention of Apple silicon here but it's worth expanding upon. Macs have a unified memory architecture. So if you have a Mac with 64GB of memory then the GPU can use all of that. This is potentially quite useful but Apple silicon in general is limited by memory bandwidth. For comparison, a 5090 is 1792GB/s. Here are some examples: - GMKTek EVO-X2: 120GB/s reads, 212GB/s writes - NVidia DGX Spark 273GB/s - Mac Mini M4 120GB/s but only $600+ - Mac Mini w/ M4 Pro 273GB/s ($2199 for 64GB) - Mac Studio M4 Max 410GB/s ($3500 for 128GB) - Mac Studio M3 Ultra 819GB/s ($5500 for 96GB) - Macbook Pro 16" with M5 Pro 64GB 307GB/s ($3300) - Macbook Pro 16" with M5 Max 128GB 460GB/s ($5399) Sadly, Apple discontinued the 512GB Mac Studio. Mac Studios are a little long in the tooth now and due for an upgrade this year. I suspect that prices will be a lot higher given the RAM prices but we'll see.