Agents of Chaos

pagade 13 points 3 comments March 07, 2026
arxiv.org · View on Hacker News

Discussion Highlights (2 comments)

cs702

TL;DR: The authors found current-generation AI agents are too unreliable, too untrustworthy, and too unsafe for real-world use. Quoting from the abstract: "We report an exploratory red-teaming study of autonomous language-model–powered agents deployed in a live laboratory environment with persistent memory, email accounts, Discord access, file systems, and shell execution. Over a two-week period, twenty AI researchers interacted with the agents under benign and adversarial conditions." "Observed behaviors include unauthorized compliance with non-owners, disclosure of sensitive information, execution of destructive system-level actions, denial-of-service conditions, uncontrolled resource consumption, identity spoofing vulnerabilities, cross-agent propagation of unsafe practices, and partial system takeover."

Muhammad523

One good reason not to use OpenClaw and the likes.

Semantic search powered by Rivestack pgvector
3,471 stories · 32,344 chunks indexed