Open-Source Agentic QA Harness with Memory

pranshuchittora 50 points 8 comments May 19, 2026
github.com · View on Hacker News

Discussion Highlights (5 comments)

pranshuchittora

Hey, I am the creator of agent-qa. Coding agents have accelerated software development, allowing folks to ship features at lightning speed, but whether the feature works in production without breaking existing behavior is still questionable. Conventionally, either a software engineer or a QA engineer converts user stories / feature PRDs into composable end-to-end tests, allowing teams to catch regressions. But with AI writing code, tests become the bottleneck. Though you can ask the coding agent to write tests, and it does write tests with reasonable correctness, AI greedily chases passing tests and sometimes bends the rules. Also, having access to the code allows it to write tests with shortcuts that might not mimic real user behavior. With agent-qa, you can write tests in plain English (natural language). It is built upon battle-tested testing frameworks (Playwright for web and Appium for mobile). Playwright and Appium work as a kernel executing the planned actions, while AI runs in the harness doing observation -> planning -> executing planned actions (via kernel) -> self-healing (in case a planned action fails) -> verification. The agent also evolves with every test run. It generates learning & product memories from each run, improving itself over time. This is in an early stage, and I’m looking forward to your feedback. Thanks! Live Demo - https://vostride.com/demo/agent-qa GitHub - https://github.com/vostride/agent-qa (Consider giving it star) Good Day!

mkdsf01

That looks interesting

willowwd9

What's the need of this? I run codex in loop and it writes and runs the playwright tests without any intervention.

ofdgdfkg9034

Can I use it with claude code?

steveharman

Interesting. Can it use Detox (React Native) and/or what does it use to talk to physical mobile devices (eg; Appium?) Thanks

Semantic search powered by Rivestack pgvector
8,303 stories · 78,303 chunks indexed