Search

Showing top 52 results for "Claude model updates"

How AI Is Transforming Work at Anthropic

…At the time this data was collected, Claude Sonnet 4 and Claude Opus 4 were the most capable models available, and capabilities have continued to advance. More capable AI brings productivity benefits…

Dec 2, 2025

Harness design for long-running application development

…Claude already scored well on craft and functionality by default, as the required technical competence tended to come naturally to the model. But on design and originality, Claude often produced outputs that…

Mar 24, 2026

Paving the way for agents in biology

…However, even the strongest models did not consistently achieve the level of accuracy and reproducibility required for reliable dataset construction. Claude Sonnet 4, Claude Opus 4.7, Biomni OSS, Edison Analysis, GPT…

Jun 8, 2026

Coding agents in the social sciences

…update An early update on what we've learned from Project Glasswing. 2028: Two scenarios for global AI leadership Our views on the AI competition between the US and China. Teaching Claude…

May 27, 2026

A “diff” tool for AI: Finding behavioral differences in new models

…Turning Claude’s thoughts into text AI models like Claude talk in words but think in numbers. In this study we train Claude to translate its thoughts into human-readable text. Donating…

Mar 13, 2026

Assessing Claude Mythos Preview’s cybersecurity capabilities

Claude Mythos Preview is a new general-purpose language model that is strikingly capable at computer security tasks. This post provides technical details for researchers and practitioners who want to understand exactly…

Apr 7, 2026

Measuring LLMs' impact on N-day exploits

…With frontier models, this bottleneck has largely fallen away. Across 18 recent Firefox security patches, Claude Mythos Preview, our most capable model, built 8 working code-execution exploits autonomously. And on 21…

Jun 8, 2026

Building Effective AI Agents

…Routing easy/common questions to smaller, cost-efficient models like Claude Haiku 4.5 and hard/unusual questions to more capable models like Claude Sonnet 4.5 to optimize for best performance…

Dec 19, 2024

AI models on realistic cyber ranges

In a recent evaluation of AI models’ cyber capabilities, current Claude models can now succeed at multistage attacks on networks with dozens of hosts using only standard, open-source tools, instead of…

Jan 16, 2026

Focus areas for The Anthropic Institute