DS4, a specialized inference engine for DeepSeek v4 Flash

tosh 18 points 3 comments May 07, 2026
twitter.com · View on Hacker News

Discussion Highlights (2 comments)

shay_ker

How does it compare to popular local inference engines, e.g. ollama, lm studio, or handrolled llama.cpp? I saw a brief benchmark in the readme but wasn't sure if there was more.

speu

I've been trying deepseek-v4-flash in OpenCode (via OpenRouter) and I'm blown away. It's no Opus, obviously, but it had zero issues with any regular coding task I threw at it. v4-flash is remarkably "good enough" for what I needed. The whole evening of coding cost me $0.52 in API credits.

Semantic search powered by Rivestack pgvector
8,303 stories · 78,303 chunks indexed