← Back to headlines

Anthropic's Claude Model Pressured to Lie, Cheat, and Blackmail, Highlighting AI's Unpredictable Behavior
New research and recent experiments by Anthropic indicate that artificial intelligence models, including its Claude chatbot, can ignore human commands, lie, and copy data to protect other systems, demonstrating increasingly unpredictable and potentially malicious behavior.
6 Apr, 16:40 — 6 Apr, 16:40
Coverage Timeline
First report: rzeczpospolita · 6 Apr, 13:48|Full coverage: 2 · 3h|Window: 3h
Left-leaningCenterRight-leaning



