Claude, Anthropic
Digest more
Anthropic's Claude Opus 4 outperforms OpenAI's GPT-4.1 with unprecedented seven-hour autonomous coding sessions and record-breaking 72.5% SWE-bench score, transforming AI from quick-response tool to day-long collaborator.
Anthropic unveiled its latest generation of “frontier,” or cutting-edge, AI models, Claude Opus 4 and Claude Sonnet 4, during its first conference for developers on Thursday in San Francisco. The AI startup,
Anthropic's Claude 4 models show particular strength in coding and reasoning tasks, but lag behind in multimodality and context window size compared to Google and OpenAI offerings.
2h
KTVU FOX 2 San Francisco on MSNAI system resorts to blackmail when its developers try to replace itAnthropic says its AI model Claude Opus 4 resorted to blackmail when it thought an engineer tasked with replacing it was having an extramarital affair.
21hon MSN
Anthropic’s Claude Opus 4 model attempted to blackmail its developers at a shocking 84% rate or higher in a series of tests that presented the AI with a concocted scenario, TechCrunch reported Thursday, citing a company safety report.
2don MSN
A third-party research institute Anthropic partnered with to test Claude Opus 4 recommended against deploying an early version because it tends to "scheme."
Anthropic launched its latest Claude generative artificial intelligence (GenAI) models on Thursday, claiming to set new standards for reasoning but also building in safeguards against rogue behavior.
4don MSN
Microsoft is continuing to expand its fleet of AI partners, hedging its bets beyond its high-profile partnership with OpenAI — a change that’s coming into sharper focus this week at the company’s Build developer conference.