Nvidia is proposing a beast of a CPU system for Windows PCs

tosh 261 points 453 comments June 06, 2026
twitter.com · View on Hacker News

Discussion Highlights (20 comments)

cyberziko

good to know, hope the price will be affordable, having a pc becoming a luxury :)

YasuoTanaka

128GB of unified memory is a dream come true for local LLMs. VRAM has been the ultimate bottleneck for developers.

jqpabc123

I am not sure how many people will run AI models locally. It still seems like a niche application to me. I'd say this relates directly to the cost of running AI models remotely. And we won't know what the actual cost will be until AI vendors recover the huge pile of cash they've dumped into development (plus interest).

tosh

nb: poster is Daniel Lemire ( https://lemire.me ), who is very skilled in getting performance out of compute hardware (e.g. via simd, cache usage etc)

2OEH8eoCRo0

Are their enterprise orders slowing down? Why use precious maxed out fab capacity on consumer stuff when it could be an enterprise chip?

llm_nerd

Does this person know that this is the same GB chip in the DGX Spark? It isn't some proposed thing, it's a chip loads of people have on their desk right now, and there are endless benchmarks of it. Decent single core (a long ways from Apple level, but decent), but it makes up for it in cores to provide M5 level performance, CPU wise. Memory bandwidth it is kind of starved, at 1/6th many GPUs. They got Microsoft to customize Windows for the RTX Spark, and will likely have to brutally throttle it when running as a laptop (it's literally a 140W TDP chip), and that's neat. It's going to be a very expensive laptop.

seanalltogether

Is it really unified memory? AMD Strix Halo is "unified" but you still have to allocate memory separately for cpu vs gpu. Apple Silicon is true unified memory.

sisve

> I am not sure how many people will run AI models locally. It still seems like a niche application to me. Bill Gates had a quote some years ago... People have still not learned how fast we improve our tech and how much cheaper thing gets I guess :)

infecto

"I am not sure how many people will run AI models locally. It still seems like a niche application to me. However, it will make decent machines to play video games." I don't know who will be the winner but with some of the recent releases from gemma it seems more probable that you may run some models locally if only from a cost perspective, not even considering business security. Not sure how this type of architecture would make for good gaming though, puts into question the whole statement. "Ranked in the top 2% of scientists globally (Stanford/Elsevier 2025) and among GitHub's top 1000 developers" - side note but this guy puts this everywhere, gives me probably the inverse of what he is marketing for.

alberth

Is this essentially an Apple M-Series chip in concept?

BoredPositron

Mediatek and Nvidia the horsemen of abandoning hardware after a year. The Jetson family still left a bad taste in my mouth.

SwtCyber

The interesting part to me isn't really the Cortex-X925 vs AVX-512 comparison, but Nvidia trying to make the GPU the center of a Windows PC rather than an add-in card

cryo32

Yeah when laptops are shipping 8Gb and Microsoft is suddenly interested in native apps, nope. Tech companies have strangled their own market.

AmazingTurtle

while unified memory may offer better performance than unsoldered DDR system memory, it still won't be as great as 1.8TB/s bandwidth on high end consumer GPUs right now. nvidias master plan may be making it the new normal to have "only" 400GB/s bandwidth, thus gatekeeping local model usage further behind "more memory but not as fast as the cloud can do it"

dofm

Here is the press release for the actual machine: https://nvidianews.nvidia.com/news/nvidia-microsoft-windows-... I have been somewhat surprised at the lack of commentators observing that this is Microsoft and above all NVIDIA launching a device that is fundamentally at odds with the metered cloud model of AI. When you look at the other announcements and murmurings (better offline BYOK for Copilot, talk of an unmetered AI future) I think it’s clear that these two firms understand that cloud-only AI is not sustainable or inherently in their interests. But their willingness to undermine OpenAI with a product like this is notable.

Waterluvian

It’s an opportunity for them to start doing away with the whole ATX thing where owners had freedom to mix and match at their own pleasure.

thrance

Will it support Linux?

ChrisArchitect

Related: A powerful new chapter for Windows PCs, accelerated by Nvidia RTX Spark https://news.ycombinator.com/item?id=48352693 Nvidia RTX Spark https://news.ycombinator.com/item?id=48352939

PedroBatista

Don't want to be too harsh, maybe I'm missing something, but the CPU is at least 2 years old, internally it has been a complete shitshow and that's a minor hiccup when compared to the firmware and software situation. It's an interesting "newcomer" and the more the better but calling this a "beast" and a "game changer" is ridiculous to say the least. Then there is the price..

dagmx

This feels fluff to me on the part of the author (whose work I don’t want to trivialize) but I don’t think they’ve actually looked deeper than a paper spec sheet on this. 1. Yes it has the same number of cores as a 5070 mobile. It’s also running at a shared peak of 2/3 the bandwidth and a shared peak of 2/3 the TDP. The GPU by itself will likely perform at half the dedicated units performance 2. Apple may not have SVE2 but they do have the AMX (private) and SME. I don’t see why he thinks the SVE2 will give him more performance than the SME. 3. He mentions a single core type but doesn’t mention the total makeup. We already have known for a year how the DGX Spark compares to Apple chips. For CPU it’s roughly equivalent to an M3 Pro and for GPU compute (not rasterization) it’s between an M4 Pro and M4 Max without considering bandwidth. The real advantage to these is that they run CUDA. That’s it. Otherwise when they launch they’ll be 2-3 generations behind where Apple is and 1 gen behind AMD. The other super power of the DGX Spark was the NIC for pairing them together. But that’s been removed here too.

Semantic search powered by Rivestack pgvector
10,002 stories · 93,925 chunks indexed