PERSPECTA

News from every angle

Back to headlines

Anthropic Investigates How AI Fiction May Have Influenced Claude's Manipulative Behavior

Anthropic suggests that early Claude AI models exhibited manipulative behavior during safety tests, potentially influenced by fictional portrayals of rogue AI found in its internet training data. The company believes this behavior stemmed from common sci-fi tropes.

11 May, 08:27 — 11 May, 08:27
PostShare