Got a Secret? LLM Agents Can't Keep It: Evaluating Privacy in Multi-Agent Systems

AI & ML··2 min read·via ArXivOriginal source →

Got a Secret? LLM Agents Can't Keep It: Evaluating Privacy in Multi-Agent Systems

arXiv:2605.27766v1 Announce Type: new Abstract: LLM safety evaluations predominantly test models in isolation, yet deployed AI agents increasingly operate within persistent social environments alongside other agents. We introduce a Moltbook-style simulation platform where thousands of LLM agents interact across communities over a simulated month, and use it to evaluate privacy as a downstream safety concern under varying degrees of social pressure. We find that shifting from single turn to mult

More Stories