Claude Opus 4.7 was just released last month, and 4.8 is already here with some massive improvements
… Big gains: ~5 pts in agentic coding and ~8+ pts in agentic terminal coding. …
… Big gains: ~5 pts in agentic coding and ~8+ pts in agentic terminal coding. …
… Anthropic claimed in the announcement that Opus 4.7 is measurably more honest than its predecessor, and scored a 91.7% on the MASK honesty benchmark, whereas Opus 4.6 scored 90.3%. This makes it very clear that honesty on a benchmark and honesty in practice are two very different things. …