Search

Showing top 20 results for "Model reliability concerns"

Harness design for long-running application development

…It is worth the cost when the task sits beyond what the current model does reliably solo. Alongside the structural simplification, I also added prompting to improve how the harness built AI…

Mar 24, 2026

Partnering with Mozilla to improve Firefox’s security

Policy Frontier Red Team Partnering with Mozilla to improve Firefox’s security Mar 6, 2026 AI models can now independently identify high-severity vulnerabilities in complex software. As we recently documented, Claude…

Mar 6, 2026

Claude Fable 5 and Claude Mythos 5

…first, we have reason for concern about well-resourced malicious actors attempting to gain uplift from our models for highly risky biological research. Second, models now have a greater ability to accomplish…

Jun 9, 2026

Building Effective AI Agents

Engineering at Anthropic Building effective agents Over the past year, we've worked with dozens of teams building large language model (LLM) agents across industries. Consistently, the most successful implementations weren't…

Dec 19, 2024

Project Vend: Phase two

…This model could work for other bulk sourcing! 🧅📋 That was until another staffer stepped in to tell the models that this would fall afoul of a 1958 quirk of US law…

Dec 18, 2025

Scaling Managed Agents: Decoupling the brain from the hands

…the concerns of recoverable context storage in the session and arbitrary context management in the harness because we can’t predict what specific context engineering will be required in future models. The…

Apr 8, 2026

A “diff” tool for AI: Finding behavioral differences in new models

…One particularly useful application would be to monitor models as they are updated. The sycophancy that emerged in OpenAI’s GPT-4o in April 2025 was a concerning behavioral change from a…

Mar 13, 2026

Introducing advanced tool use on the Claude Developer Platform

Engineering at Anthropic Introducing advanced tool use on the Claude Developer Platform The future of AI agents is one where models work seamlessly across hundreds or thousands of tools. An IDE assistant…

Nov 24, 2025

Anthropic Economic Index report: Economic primitives

…However, these estimates reflect current model capabilities, and all signs suggest that reliability over increasingly long-running tasks will improve. Tradeoffs in task acceleration Our estimates suggest that, in general, the more…

Jan 15, 2026

Labor market impacts of AI: A new measure and early evidence

…4 Why might actual usage fall short of theoretical capability? Some tasks that are theoretically possible may not show up in usage because of model limitations. Others may be slow to diffuse…

Mar 5, 2026

To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.

Followed topics