FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels

PaulHoule 17 points 1 comment May 12, 2026

arxiv.org · View on Hacker News

Discussion Highlights (1 comments)

Reubend

Paper looks great. No GitHub link that I can find though. Maybe I'll take a crack at an implementation if I've got some extra free time.

Semantic search powered by Rivestack pgvector
14,015 stories · 131,331 chunks indexed