Search

Showing top 135 results for "Claude Opus 4.8 update"

Videos

GPT-5.5 dominates $1,500 LLM hacking test while Gemini refuses to even try

…Claude Sonnet 4.6 and Claude Opus 4.8 each solved 2 out of 10 runs, but Opus in particular got close multiple times before safety guardrails ended the session. At the…

Jun 4, 2026 · Anubhav Sharma

Paper page - Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents

…models from different capability tiers produce harness updates that lead to surprisingly similar gains; even Qwen3.5-9B's updates yield gains comparable to those of Claude Opus~4.6. Second, harness…

Jun 1, 2026

I Vibe-Coded a Portfolio for My Resume With Claude. It Was Stunning but Riddled With Errors

…I also went in and selected the latest Claude model, which is Opus 4.7. I clicked on the icon on the far left of the chat box. You can drop files…

May 6, 2026 · See full bio

Agents for financial services

…These updates pair best with Claude Opus 4.7, which is state-of-the-art on financial tasks and leads the industry on Vals AI's Finance Agent benchmark , at 64.37…

May 5, 2026

Discussions and forums

Hacker News · u/davidvgilmore · 1w ago

Show HN: Rayline routes Claude Code subagents to on-device and cheaper models

Hi HN,I’m one of the builders of Rayline.Rayline is a Claude Code compatible LLM gateway. It intercepts and overrides claude code’s internal routing and lets you route subagent calls to different models instead. For exam…

10 8

Hacker News · u/mesmertech · 2w ago

Ask HN: Anyone else seeing serious degradation in DX with Opus 4.8?

As an anthropic fan boy(check my prev. comments), this is the first opus release where I feel like the model is just not pleasant to talk to not to mention untrustworthy.The two examples for me where I lost confidence in…

Hacker News · u/adamthegoalie · May 11, 2026

Show HN: adamsreview – better multi-agent PR reviews for Claude Code

I built adamsreview, a Claude Code plugin that runs deeper, multi-stage PR reviews using parallel sub-agents, validation passes, persistent JSON state, and optional ensemble review via Codex CLI and PR bot comments.On my…

85 55

Hacker News · u/abi · May 14, 2026

Show HN: 1-800-CODER, macOS app where you call an AI developer to edit your page

Sharing a small Mac app I built around OpenAI’s gpt-realtime-2 model. You call up a voice coding agent and talk to it like you’d talk to a freelancer ("make the hero tighter, put a product image on the right, that one's …

Hacker News · u/sminchev · May 18, 2026

I created a 126K line Android app with AI – the workflow that worked for me

I really wanted to see how far I can go. Can I create a meaningful and complex application, big enough, but without knowing the language.I have 18+ years of experience as software developer. But I have no experience with…

Anthropic pulls Claude Mythos 5 and Claude Fable 5 following US government directive - 9to5Mac

…model or Opus 4.8, and existing Fable 5 sessions will end with an error. On the Claude Platform, requests to Fable 5 will also return an error. Please update your integrations…

Jun 13, 2026 · Marcus Mendes

Claude Code has become dumber, lazier: AMD director

AI + ML AMD's AI director slams Claude Code for becoming dumber and lazier since last update 'Claude cannot be trusted to perform complex engineering tasks' according to GitHub ticket If you…

Apr 6, 2026 · Brandon Vigliarolo

Teaching Claude why

…Thus, after Claude 4, it was clear we needed to improve our safety training and, since then, we have made significant updates to our safety training. We use agentic misalignment as a…

May 8, 2026

Mythos-class Claude Fable 5 arrives on GitLab Duo Agent Platform

…For developers, platform engineers, and engineering leaders, this is not an incremental model update. Claude Fable 5 completes multi-step, goal-directed work that previous models could not sustain, and it does…

Jun 9, 2026 · Talia Armato-Helle

What an AI-designed car looks like

…started: * The AI-designed car is taking shape * OpenAI’s big Codex update is a direct shot at Claude Code * Microsoft and OpenAI’s famed AGI agreement is dead * OpenAI’s new…

May 5, 2026 · David Pierce

Anthropic blames dystopian sci-fi for training AI models to act “evil”

…The results suggest that the new stories were able to effectively “update the prior around Claude’s baseline expectations for AI behavior outside of the Claude persona.” The researchers theorize that this…

May 13, 2026 · Kyle Orland

Followed topics