Show HN: Local-first fast CPU image to text for screenshots, PDFs, webpages
mrkn1
14 points
16 comments
June 05, 2026
Related Discussions
Found 5 related stories in 106.4ms across 10,324 title embeddings via pgvector HNSW
- Show HN: CPU-only transcription for YouTube, TikTok, X, Instagram videos mrkn1 · 52 pts · May 20, 2026 · 64% similar
- Show HN: Online OCR Free – Batch OCR UI for Tesseract, Gemini and OpenRouter naimurhasanrwd · 13 pts · March 03, 2026 · 57% similar
- Show HN: Textile – A desktop app for weaving together bits of text stack_framer · 30 pts · June 01, 2026 · 57% similar
- Show HN: OpenBrief – Local-first video downloader/summarizer tantara · 42 pts · May 25, 2026 · 57% similar
- Show HN: Free, in-browser PDF editor philjohnson · 51 pts · March 28, 2026 · 56% similar
Discussion Highlights (8 comments)
garrett2558
Very cool, I'm building my own local-first product as well
abstract257
Curious how it does on multi-page scanned PDFs vs. single screenshots? The ORT vision/decoder split is the part that usually makes or breaks CPU VLM OCR...
BIGFOOT_EXISTS
Now this is legit cool, keep up the great work.
vivzkestrel
- how well do you think this ll work with code? i mean take code screenshots and convert it into actual code for vscode
kouru225
Roman alphabet only or does this work with other alphabets?
monosma
What was the reason for adopting PaddleOCR? Can other OCR models be used as well?
KetoManx64
What's the performance like compared to tesseract? I don't see tesseract mentioned anywhere in the readme, which is surprising considering that's the number one tool most go to for Image > text OCR.
lavaman131
This is awesome! Been needing something like this for some research paper diagrams I've been indexing.