Show HN: LiteParse, a fast open-source document parser for AI agents

freezed8 11 points 0 comments March 20, 2026
github.com · View on Hacker News

LiteParse is an open-source (Apache 2.0) document parser that provides high-quality spatial text parsing with bounding boxes. It does not depend on local or frontier VLMs. Because it does not require GPUs, liteparse can be run on any machine, and process a few hundred pages of documents in seconds. It offers higher accuracy than similar tools like PyPDF, PyMuPDF, MarkItDown. It supports a variety of file formats - PDFs, Office documents, images. It can be one-line installed as a skill for 40+ different AI agents, including Claude Code, Cursor, OpenClaw, Windsurf, and more.

Semantic search powered by Rivestack pgvector
3,471 stories · 32,344 chunks indexed