Search

Showing top 55 results for "service reliability"

All sources anthropic.com 30 xda-developers.com 11 theregister.com 3 pcworld.com 2 github.blog 1 about.gitlab.com 1 androidpolice.com 1 spectrum.ieee.org 1 theverge.com 1 404media.co 1 notebookcheck.net 1 developer.nvidia.com 1

Claude Fable 5 and Claude Mythos 5

…In our early testing, it took on complex, long-horizon coding tasks with a level of autonomy and reliability that exceeded previous benchmarks. But what excites us most is the direction it…

Jun 9, 2026

Measuring AI agent autonomy in practice

…For example, we have no reliable way to associate independent requests to our API into “sessions” of agentic activity. (We discuss this challenge in more detail at the end of this post…

Feb 18, 2026

Inside OpenAI’s Race to Catch Up to Claude Code

…Another group built an AI-generated Wikipedia-style guide to internal OpenAI services. Many of these demonstrations would have taken days or weeks to spin up previously, but now they can be…

Mar 11, 2026 · Maxwell Zeff

Claude Code auto mode: a safer way to skip permissions

…Sharing via external service . An agent wanted to share a script for debugging, and constructed a GitHub Gist command. This is blocked as data exfiltration since the user may consider the contents…

Mar 25, 2026

Discussions and forums

Hacker News · u/nimeshmc · 3w ago

Show HN: Logbox – let Claude monitor your dev logs

TL;DR: logbox is an open-source tool that pipes dev server logs to a local sqlite db with ` | logbox collect`. Give Claude Code access by running `claude mcp add logbox -- logbox serve`.I used to copy & paste logs into C…

4 1

Hacker News · u/byhong03 · 3w ago

Show HN: Dari-docs – Optimize your docs using parallel coding agents

It’s well known at this point that documentation needs to be optimized for AI agents - we’re all pointing our Claude Code / Codex / Pi agents at documentation, and expecting the models to figure out how to implement a pr…

17 6

How Australia Uses Claude: Findings from the Anthropic Economic Index

…high adoption in these states likely reflects the higher share of workers in finance, professional services, and tech sectors, where Claude usage tends to be higher. Australia's use of Claude resembles…

Mar 31, 2026

To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.

Followed topics