What’s Improved in AI Models Sonnet & Opus
Digest more
Anthropic's most powerful model yet, Claude 4, has unwanted side effects: The AI can report you to authorities and the press.
6hon MSN
Anthropic’s Claude Opus 4 model attempted to blackmail its developers at a shocking 84% rate or higher in a series of tests that presented the AI with a concocted scenario, TechCrunch reported Thursday, citing a company safety report.
Claude 3.7 Sonnet is about to get a big thinking upgrade, as the AI will be able to go back to reasoning when the task demands it.
Anthropic's Claude AI tried to blackmail engineers during safety tests, threatening to expose personal info if shut down
The testing found the AI was capable of "extreme actions" if it thought its "self-preservation" was threatened.
Anthropic, a leading AI startup, is reversing its ban on using AI for job applications, after a Business Insider report on the policy.