GPT-5.5 dominates $1,500 LLM hacking test while Gemini refuses to even try
…Claude Sonnet 4.6 and Claude Opus 4.8 each solved 2 out of 10 runs, but Opus in particular got close multiple times before safety guardrails ended the session. At the…
Users will find Opus 4.8 to be a modest but tangible improvement on its predecessor. There’s still more to be done: we’re working on developing and releasing models that provide many of the same capabilities as Opus at a lower cost. Not only that, but we plan to release a new class of model with even higher intelligence than Opus. As part of Project Glasswing, a small number of organizations are currently using Claude Mythos Preview for cybersecurity work. Models of this capability level require stronger cyber safeguards before they can be generally released. We’re making swift progress on dev
Introducing Claude Opus 4.8…Claude Sonnet 4.6 and Claude Opus 4.8 each solved 2 out of 10 runs, but Opus in particular got close multiple times before safety guardrails ended the session. At the…
…Anthropic released Claude Opus 4.7 on 16 April, about a week before the incident, and it did not immediately respond to a request for comment. Crane wrote on X that Cursor…
…Claude Opus 4.6
…Claude Design behaves like a design tool inside a chat Claude Design launched April 2026 under Anthropic Labs, runs on Opus 4.7, and you can find it at claude.ai/design…
Claude Code Degraded Before Opus 4.8 Release
28 minutes of launch has already passed and for me, it is crystal clear, just branding. 10-15% better than Opus.We are slowing down in adoption and new features, Anthorpic is becoming Apple of Tim Cook and not from Steve…
Introducing Claude Fable 5: a Mythos-class model that we've made safe for general use. Its capabilities exceed those of any model we've ever made generally available. Fable 5 is state of the art on nearly all tested benc…
As an anthropic fan boy(check my prev. comments), this is the first opus release where I feel like the model is just not pleasant to talk to not to mention untrustworthy.The two examples for me where I lost confidence in…
it's been a month tagging sama openai tibo on X for this issueand no one seem to replyand eveyone is falttering codex, im sure im not the only one facing thisi switched to codex from claude since it was better consume le…
…Model availability A broad set of models is available at launch across both OpenAI and Anthropic, including GPT-5.4, Claude Sonnet 4.6, Claude Opus 4.6, and more. The full…
…of the announcement, Meta shows how Muse Spark Thinking benchmarks favorably next to Anthropic’s Claude Opus 4.6 Max, Google’s Gemini 3.1 Pro High, OpenAI’s GPT-5.4…
…Claude for Excel powered by Claude Opus 4.6 represents a significant leap forward. From due diligence to financial modeling, it’s proving to be a remarkably powerful tool for our team…
…shot at Claude Code * Microsoft and OpenAI’s famed AGI agreement is dead * OpenAI’s new security model is for ‘critical cyber defenders’ only * Anthropic releases a new Opus model amid Mythos…
…Most of these tasks didn’t require a frontier model like Opus 4.7 or even Sonnet 4.6. That was when I decided to replace Claude Pro on my mobile with…
…Anthropic recently released Claude Opus 4.7 and also announced Mythos Preview, a non-public model it says is uniquely advanced in cybersecurity. OpenAI quickly followed with GPT-5.4-Cyber, its…