Show HN: TurboQuant-WASM – Google's vector quantization in the browser
teamchong
148 points
6 comments
April 04, 2026
Related Discussions
Found 5 related stories in 55.5ms across 3,558 title embeddings via pgvector HNSW
- TurboQuant: Building a Sub-Byte KV Cache Quantizer from Paper to Production wizzense · 13 pts · March 27, 2026 · 59% similar
- Show HN: µJS, a 5KB alternative to Htmx and Turbo with zero dependencies amaury_bouchard · 124 pts · March 07, 2026 · 58% similar
- Show HN: 2.7KB Zig WASM – live globe showing executions at 300 CF edges wolfejam · 20 pts · March 29, 2026 · 58% similar
- TurboQuant: Redefining AI efficiency with extreme compression ray__ · 509 pts · March 25, 2026 · 58% similar
- Show HN: Ripl – A unified 2D/3D engine for Canvas, SVG, WebGPU, and the Terminal andrewcourtice · 15 pts · March 19, 2026 · 54% similar
Discussion Highlights (4 comments)
hhthrowaway1230
Awesome! Also love the gaussian splat demo, cool use case!
glohbalrob
Very cool. I added the new multi embedding 2 model to my site the other week from google I guess need to dig into this and see if it’s faster and has more use cases! Thanks for publishing your work
refulgentis
Sloppiest slop I've seen in a couple weeks: - fork of a fork of a quantization technique - Only contribution is...compiling JS to WASM by default? - suspicious burst of ~nothing comments from new accounts - 6 comments 7 hours in, 4 flagged/dead, other 2 also spammy, confused and making category errors at best, at worst, more spam. - Demo shows it's worse : 800 ms instead of 2.6 ms for text embedding search - "but it saves space" - yes! 1.2 MB in RAM instead of 7.2 MB to turn search into 1s on a MacBook Pro M4 Max, instead of sub-frame duration. - It's not even wrong to do this with the output embeddings, there's way more obvious ways to save space that don’t affect retrieval time this much
netdur
I tried TQ for vector search and my findings is not good, it is not worth it if you cannot use GPU, however I got same quality of search as 32f using 8bit quant I wrote ann ext for sqlite, using tq, I do save a lot on space but 32f is still faster despite everything I have tried code here https://github.com/netdur/munind/tree/main/src/tq