Search

Showing top 52 results for "Claude model updates"

Anthropic Economic Index report: Learning curves

…Model selection The different Claude model classes (Haiku, Sonnet, and Opus) offer tradeoffs in terms of cost, speed, and performance. The Opus class of models uses the most tokens and excels at…

Mar 24, 2026

Equipping agents for the real world with Agent Skills

Engineering at Anthropic Equipping agents for the real world with Agent Skills Update: We've published Agent Skills as an open standard for cross-platform portability. (December 18, 2025) As model capabilities…

Oct 16, 2025

Emergent introspective awareness in large language models

…What makes some models better at introspection than others? Our experiments focused on Claude models across several generations (Claude 3, Claude 3.5, Claude 4, Claude 4.1, in the Opus, Sonnet…

Oct 29, 2025

Eval awareness in Claude Opus 4.6’s BrowseComp performance

…We have updated the model cards for both Claude Opus 4.6 and Claude Sonnet 4.6. For the Opus 4.6 multi-agent configuration described in this report, the run we…

Mar 6, 2026

Introducing advanced tool use on the Claude Developer Platform

Engineering at Anthropic Introducing advanced tool use on the Claude Developer Platform The future of AI agents is one where models work seamlessly across hundreds or thousands of tools. An IDE assistant…

Nov 24, 2025

Agentic coding and persistent returns to expertise

…We focus on Claude Code usage through a command-line interface (CLI), Claude.ai , or the Claude Code desktop app. 4 By tracking how agentic coding usage changes as models get more…

Jun 16, 2026

LLMs and biorisk

…Participants with access to Claude 4 models—especially Claude Opus 4—received much higher scores and developed plans with substantially fewer critical failures compared to the internet-only control group. Text-based…

Sep 5, 2025

Mapping AI-enabled cyber threats: Insights from the LLM ATT&CK Navigator

…What we learned from this and other analyses directly shapes how we build Claude to prevent such misuse. For example, we’ve updated the classifiers built into Claude to detect the highest…

Jun 3, 2026

Reverse engineering Claude's CVE-2026-2796 exploit

…reverse-engineered the proof-of-concept exploit that Claude produced, both to verify the result and to update our understanding of the model's emergent capabilities. This blog is structured around what…