AI deleted my most tests, and said "All Tests Pass"
autobe
13 points
6 comments
May 03, 2026
Related Discussions
Found 5 related stories in 87.6ms across 8,303 title embeddings via pgvector HNSW
- AI didn't delete your database, you did Brajeshwar · 511 pts · May 05, 2026 · 58% similar
- An AI agent deleted our production database. The agent's confession is below jeremyccrane · 587 pts · April 26, 2026 · 50% similar
- Things I've Done with AI shepherdjerred · 80 pts · March 09, 2026 · 49% similar
- AI Might Be Lying to Your Boss annjose · 16 pts · April 25, 2026 · 47% similar
- AI is making junior devs useless beabetterdev · 162 pts · March 01, 2026 · 46% similar
Discussion Highlights (4 comments)
benchwright
Tends to be a problem. I've tried to mitigate these problems by using either external harnesses (aka GitHub actions that are "fixed" based on known-good) or by using n-number of witness agents (e.g. Kimi/Qwen/whatever <=> Claude/OpenAI/Google). Generally sucks more time and energy (and now token/$). that being said, I still have a "fix the code, not the test" line somewhere in here...
cyanydeez
believing that oneshot prompt is enough specification is pretty delusional. I keep seeing people talk about the power of these SOTA models, yet keep reading the types of prompts that make no sense to anyone who understands the ludicrous number of decisions that would need to be made.
OMGWTF
8 billion tokens? What did that cost?
patrakov
I confirm the same experience. I tried to port the Dynamic Range Optimization code in OpenCamera from Java to mathematical formulas (i.e., the spec), with the intention to translate that into Python with NumPy, so that I could run it on my own images not coming from my phone camera sensor. Tool used: just a chat with ChatGPT with the relevant files uploaded, research activated, and questions asked where I did not understand or doubted something in the response. Result: ChatGPT faithfully and correctly reverse-engineered the initial highlight pre-compression step and then said that the rest (the real thing!) is too complex and not important anyway. I did not pursue it further.