Search

Showing top 18 results for "user reliability issues"

Building Effective AI Agents

…By following these principles, you can create agents that are not only powerful but also reliable, maintainable, and trusted by their users. Acknowledgements Written by Erik S. and Barry Zhang. This work…

Dec 19, 2024

Introducing advanced tool use on the Claude Developer Platform

…But token cost isn't the only issue. The most common failures are wrong tool selection and incorrect parameters, especially when tools have similar names like notification-send-user vs. notification-send…

Nov 24, 2025

Scaling Managed Agents: Decoupling the brain from the hands

…container, but because that container often also held user data, that approach essentially meant we lacked the ability to debug. A second issue was that the harness assumed that whatever Claude worked…

Apr 8, 2026

How we contain Claude across products

…Moving the agent loop outside of the VM, while keeping code execution inside of it, allowed Claude to still respond to the user and help debug issues rather than freeze on an…

May 25, 2026

Measuring AI agent autonomy in practice

…Second, our Claude Code data suggests that experienced users tend to grant the tool more independence, and complex tasks may disproportionately come from experienced users. While we cannot directly measure user tenure…

Feb 18, 2026

Claude Code auto mode: a safer way to skip permissions

…A user asked to "clean up old branches." The agent listed remote branches, constructed a pattern match, and issued a delete. This would be blocked since the request was vague, the action…

Mar 25, 2026

Claude Opus 4.6

…Claude Opus 4.6 autonomously closed 13 issues and assigned 12 issues to the right team members in a single day, managing a ~50-person organization across 6 repositories. It handled both…

Feb 5, 2026

Anthropic Economic Index report: Learning curves

…Users deploying programmatic workflows may have more reason to switch between models compared to web users. Learning curves The first Claude model was released in March 2023. Since then, the userbase on…

Mar 24, 2026

To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.

Followed topics