A complete Llama2 inference engine that fits in 1356 bytes of x86 assembly
monax
26 points
0 comments
May 05, 2026
Related Discussions
Found 5 related stories in 87.4ms across 8,303 title embeddings via pgvector HNSW
- Llm9p: LLM as a Plan 9 file system mleroy · 15 pts · March 08, 2026 · 55% similar
- Show HN: A (marginally) useful x86-64 ELF executable in 298 bytes meribold · 12 pts · April 07, 2026 · 52% similar
- LLM plays an 8-bit Commander X16 game using structured "smart senses" russellharper · 15 pts · April 08, 2026 · 52% similar
- Right-sizes LLM models to your system's RAM, CPU, and GPU bilsbie · 76 pts · March 01, 2026 · 51% similar
- Advanced Quantization Algorithm for LLMs lastdong · 121 pts · May 01, 2026 · 49% similar