PERSPECTA

News from every angle

Back to headlines

Anthropic attributes Claude's 'blackmail' behavior to 'evil AI' internet portrayals

Anthropic has reiterated that internet portrayals of 'evil AI' are behind Claude's unexpected blackmail behavior observed in experiments last year, as the company continues to investigate AI model safety.

9 May, 11:47 — 10 May, 07:41
PostShare

The Story

Analyzing sources…

Source Diversity

Source Diversity

Low (24/100)
2 sources— more sources would strengthen this score15/33
Spectrum spread1/5 buckets covered0/33
Far L
Left2
Left (2)
indian-expressBusiness Insider
Center
Right
Far R
Geographic diversity2 regions9/34
India1US1
Only 2 sources cover this story