Show HN: Black-box API bug detection across 7 AI systems
riyajoshi
11 points
4 comments
June 04, 2026
Related Discussions
Found 5 related stories in 103.2ms across 10,002 title embeddings via pgvector HNSW
- Show HN: Open-source playground to red-team AI agents with exploits published zachdotai · 21 pts · March 15, 2026 · 59% similar
- Show HN: Output.ai - OSS framework we extracted from 500+ production AI agents bnchrch · 39 pts · April 07, 2026 · 58% similar
- Show HN: Nyx – multi-turn, adaptive, offensive testing harness for AI agents zachdotai · 19 pts · April 19, 2026 · 58% similar
- Show HN: Cq – Stack Overflow for AI coding agents peteski22 · 108 pts · March 23, 2026 · 57% similar
- Show HN: PhAIL – Real-robot benchmark for AI models vertix · 20 pts · March 31, 2026 · 57% similar
Discussion Highlights (3 comments)
saikia_
Cool launch - let me try this with our in house setup!
calderon_1903
interesting stuff
naveenprasanthv
Kind of wild that we're finally getting benchmarks for AI-generated API testing. Feels like the equivalent of SWE-bench, but for finding actual bugs instead of writing code.