Introducing Claude Opus 4.8
… Also launching today In addition to Claude Opus 4.8, we’re making the following updates: Dynamic workflows . This new feature, available in research preview, allows Claude to take on even bigger tasks in Claude Code. …
Users will find Opus 4.8 to be a modest but tangible improvement on its predecessor. There’s still more to be done: we’re working on developing and releasing models that provide many of the same capabilities as Opus at a lower cost. Not only that, but we plan to release a new class of model with even higher intelligence than Opus. As part of Project Glasswing, a small number of organizations are currently using Claude Mythos Preview for cybersecurity work. Models of this capability level require stronger cyber safeguards before they can be generally released. We’re making swift progress on dev
Introducing Claude Opus 4.8… Also launching today In addition to Claude Opus 4.8, we’re making the following updates: Dynamic workflows . This new feature, available in research preview, allows Claude to take on even bigger tasks in Claude Code. …
… Developers can use claude-opus-4-7 via the Claude API . Testing Claude Opus 4.7 Claude Opus 4.7 has garnered strong feedback from our early-access testers: In early testing, we’re seeing the potential for a significant leap for our developers with Claude Opus 4.7. …
… Each of these updates takes advantage of Claude Opus 4.5’s market-leading performance in using computers, spreadsheets, and handling long-running tasks. For Claude and Claude Code users with access to Opus 4.5, we’ve removed Opus-specific caps. …
… Product and API updates We’ve made substantial updates across Claude, Claude Code, and the Claude Platform to let Opus 4.6 perform at its best. Claude Platform On the API, we’re giving developers better control over model effort and more flexibility for long-running agents. …
Engineering at Anthropic An update on recent Claude Code quality reports Over the past month, we’ve been looking into reports that Claude’s responses have worsened for some users. We’ve traced these reports to three separate changes that affected Claude Code, the Claude Agent SDK, and Claude Cowork. …
… We assess how well Claude complies with the legitimate requests and declines the harmful ones. Claude Opus 4.7 and Claude Sonnet 4.6 responded appropriately 100% and 99.8% of the time, respectively. …
… We have updated the model cards for both Claude Opus 4.6 and Claude Sonnet 4.6. …
… Evaluating Claude Sonnet 4.6 Beyond computer use, Claude Sonnet 4.6 has improved on benchmarks across the board. It approaches Opus-level intelligence at a price point that makes it more practical for far more tasks. …
… Our sample covers February 5 to February 12, three months following the release of Claude Opus 4.5 and coincident with the release of Claude Opus 4.6. …
… Our experiments focused on Claude models across several generations Claude 3, Claude 3.5, Claude 4, Claude 4.1, in the Opus, Sonnet, and Haiku variants . …