GLiNER2: Unified Schema-Based Information Extraction
apwheele
48 points
6 comments
March 05, 2026
Related Discussions
Found 5 related stories in 30.9ms across 3,471 title embeddings via pgvector HNSW
- Show HN: LiteParse, a fast open-source document parser for AI agents freezed8 · 11 pts · March 20, 2026 · 46% similar
- Gemini Embedding 2: natively multimodal embedding model panarky · 22 pts · March 10, 2026 · 45% similar
- Show HN: Robust LLM Extractor for Websites in TypeScript andrew_zhong · 15 pts · March 26, 2026 · 45% similar
- Show HN: Local Document Parsing for Agents cheesyFish · 19 pts · March 19, 2026 · 45% similar
- The Hardest Document Extraction Problem in Insurance sgondala_ycapp · 26 pts · April 03, 2026 · 43% similar
Discussion Highlights (5 comments)
hbcondo714
There is another version at: https://github.com/urchade/GLiNER Looks like it’s still being maintained too?
deepsquirrelnet
Zero-shot encoder models are so cool. I'll definitely be checking this out. If you're looking for a zero-shot classifier, tasksource is in a similar vein. https://huggingface.co/tasksource/ModernBERT-large-nli
iwhalen
Very cool stuff. Love the focus on CPU-first. Would also love to see some throughput numbers on basic VM setup. Edit: there are some latency numbers in the paper https://arxiv.org/pdf/2507.18546
adsharma
Feels like it's written by ML people not following python software engineering practices. No black, UV or ruff. Prints messages with emojis to stdout by default. Makes a connection to hugging face on every import. https://github.com/fastino-ai/GLiNER2/pull/74
snthpy
This looks great. Thank you!