1-Bit Bonsai Image 4B Image Generation for Local Devices
modinfo
340 points
121 comments
May 31, 2026
Related Discussions
Found 5 related stories in 109.8ms across 9,043 title embeddings via pgvector HNSW
- Show HN: 1-Bit Bonsai, the First Commercially Viable 1-Bit LLMs PrismML · 182 pts · March 31, 2026 · 67% similar
- Show HN: Bonsai 1.7B ternary model at 442T/s on M4 Max hhuytho · 13 pts · May 04, 2026 · 57% similar
- Bun.Image chakintosh · 33 pts · May 23, 2026 · 50% similar
- Perceptual Image Codec: What Matters in Practical Learned Image Compression ksec · 107 pts · May 24, 2026 · 49% similar
- BitNet: 100B Param 1-Bit model for local CPUs redm · 326 pts · March 11, 2026 · 48% similar
Discussion Highlights (20 comments)
sorenjan
They call it a diffusion model, but it's based on Flux.2 which is a rectified flow model.
MitPitt
Lately I've noticed posts with barely 10 points getting to HN frontpage. Was it always like this?
yieldcrv
impressive, combines a couple techniques that I always wanted the frontier models to have having trouble loading the webgl browser demo on my phone but no biggy
lumost
I actually can’t wait for the future where I upgrade hardware in order to upgrade my ai as an alternative to an expensive subscription. There are many problems I want to work on which require billions of tokens. These are completely inaccessible without corporate project sponsorship at the moment. An asic generation machine which can pump out a few 10s of thousands of tokens per second at opus4.6 quality is more than sufficient.
a1o
Anyone could pickup the minimal hardware requirements for this? Like both RAM and Storage?
SilentM68
Question, Is it compatible with Ollama, ComfyUI or are those providers unneeded, compatible with low-end hardware? Also, where does "./setup.sh/ drop the components in Linux? Thank you, Sol
wiradikusuma
Is there a benchmark of local image generation models? Local = can run on a 16 GB MacBook or 8 GB+ NVIDIA card.
janniks
I was expecting to see images of Bonsai trees when I clicked this
potatoman22
I wonder why they didn't use a Bonsai model as the text encoder
sudb
Very interested to see where this kind of work goes for on-device video generation!
iJohnDoe
Does anyone ever get their stuff to actually work. Like actually load?
jeroenhd
Couldn't try it because the demo app is iOS only and the web version just crashes my browser. The small model is impressive but if you front load a 1.8GB text encoder model, the savings aren't quite as useful. I do wonder how these compare to existing image generation models. I've tried https://github.com/alichherawalla/off-grid-mobile-ai for a while but I find the image generation models rather lacking.
captainregex
what trade off would one need to clear to justify the hardware and the work to get this running locally as part of a broader system? It’s a lot of work setting up and maintaining a production harness/system on a local device. I don’t personally repeatedly generate images at a scale where using a lab’s app somehow burns all my tokens. I like the ideas of local ai but I don’t see widespread adoption of it happening in commercial or customer situations anytime soon no matter how little/good enough they get. Even Uber- token burn whiplash but I doubt their answer will be “run some of it local”. IT nightmare, I’d imagine.
smallerize
To our knowledge, Bonsai Image 4B is the first image model in its parameter class to run directly on an iPhone. Isn't SD XL 3.5B? And the refiner model is even larger. Those can run on an iPhone 13 Pro.
woadwarrior01
The text encoder is still 4-bit quantized.
mft_
Genuine question: is this solving a real problem? IME, the bottleneck when using diffusion models isn't storage space or memory, it's generation time. Lots of models will run on 8-12 GB 1080-generation GPUs onwards, or on Macs with similar memory, which are probably the bottom end from a GPU power perspective anyway. I also note that these models are marginally slower than the small FLUX.2 model they're based on. Okay, maybe this allows running a local model on something that has a reasonably powerful GPU and limited memory, like an iPhone, but is that really a common requirement?
junto
Just a side note, that this website is classified by Apple as an Adult website. I have Limit Adult Websites set in Content & Privacy Restrictions switched on. Led me to wonder what happens if a domain gets a new owner, and they want to petition Apple to remove the block.
moralestapia
This is why I don't think the big AI companies and nvidia will dominate the market. AIs will just run locally, on whatever hardware you have. Perhaps that's why they worked on this yet-to-be-defined partnership with ARM.
jijji
Using the demo and typing in "A sign that says xxxx" where xxxx is any text, it gets it wrong almost 100% of the time.
danielEM
Is there a way to run it on Vulkan?