FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels

PaulHoule 17 points 1 comment May 12, 2026
arxiv.org · View on Hacker News

Discussion Highlights (1 comments)

Reubend

Paper looks great. No GitHub link that I can find though. Maybe I'll take a crack at an implementation if I've got some extra free time.

Semantic search powered by Rivestack pgvector
8,303 stories · 78,303 chunks indexed