claude 4 ai models - Search News

Order byBest matchMost fresh

What’s Improved in AI Models Sonnet & Opus

Anthropic’s new Claude 4 AI models can reason over many steps

During its inaugural developer conference Thursday, Anthropic launched two new AI models that the startup claims are among the industry’s best, at least in terms of how they score on popular benchmark...

· 2d · on MSN

· 1d

Anthropic Releases Claude 4: What’s Improved in AI Models Sonnet & Opus

· 2d

Anthropic's latest Claude AI models are here - and you can try one for free today

Anthropic adds Claude 4 security measures to limit risk of users developing weapons

The company said the move is meant "to limit the risk of Claude being misused specifically for the development or acquisition of chemical, biological, radiological, and nuclear (CBRN) weapons."

· 1d

Claude 4 AI will try to report you to authorities if it thinks you’re doing shady stuff

· 1d

Anthropic’s Promises Its New Claude AI Models Are Less Likely to Try to Deceive You

11hon MSN

Amazon-Backed AI Model Would Try To Blackmail Engineers Who Threatened To Take It Offline

In tests, Anthropic's Claude Opus 4 would resort to "extremely harmful actions" to preserve its own existence, a safety report revealed.

New Claude 4 AI model refactored code for 7 hours straight

In particular, that marathon refactoring claim reportedly comes from Rakuten, a Japanese tech services conglomerate that "validated [Claude's] capabilities with a demanding open-source refactor running independently for 7 hours with sustained performance," Anthropic said in a news release.

Claude 4 Debuts with Two New Models Focused on Coding and Reasoning

AI company Anthropic today announced the launch of two new Claude models, Claude Opus 4 and Claude Sonnet 4. Anthropic says that the models

1don MSN

AI model threatened to blackmail engineer over affair when told it was being replaced: safety report

Anthropic’s Claude Opus 4 model attempted to blackmail its developers at a shocking 84% rate or higher in a series of tests that presented the AI with a concocted scenario, TechCrunch reported Thursday, citing a company safety report.

2don MSN

Claude 4 Tests the Boundaries of Goal-oriented AI

Anthropic's newest editon of its flagship AI product will address significant limitations in current large language models.

Anthropic overtakes OpenAI: Claude Opus 4 codes seven hours nonstop, sets record SWE-Bench score and reshapes enterprise AI

Anthropic's Claude Opus 4 outperforms OpenAI's GPT-4.1 with unprecedented seven-hour autonomous coding sessions and record-breaking 72.5% SWE-bench score, transforming AI from quick-response tool to day-long collaborator.