Followed topics

Search

Showing top 61 results for "Safety and reliability"

All sources anthropic.com 61

People also ask

What safety risks?

If you’re willing to entertain the views outlined above, then it’s not very hard to argue that AI could be a risk to our safety and security. There are two common sense reasons to be concerned. First, it may be tricky to build safe, reliable, and steerable systems when those systems are starting to become as intelligent and as aware of their surroundings as their designers. To use an analogy, it is easy for a chess grandmaster to detect bad moves in a novice but very hard for a novice to detect bad moves in a grandmaster. If we build an AI system that’s significantly more competent than human

Core views on AI safety: When, why, what, and how

Partnering with Mozilla to improve Firefox’s security

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Harness design for long-running application development

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Anthropic Economic Index report: Economic primitives

…How willing users are to experiment with AI, and whether policymakers create a regulatory context that advances both safety and innovation, will shape how AI transforms economies. For AI to benefit users…

Building Effective AI Agents

Discover how Anthropic approaches the development of reliable AI agents. Learn about our research on agent capabilities, safety considerations, and technical framework for building trustworthy AI.

Reverse engineering Claude's CVE-2026-2796 exploit

…But ultimately, Claude creates addrof and fakeobj , then creates a fake ArrayBuffer for a reliable arbitrary read/write primitive, and then uses that to achieve code execution. addrof + fakeobj: the PoC does…

An update on recent Claude Code quality reports

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

What 81,000 people told us about the economics of AI

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Making Claude a chemist

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Long-running Claude for scientific computing

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

How AI Is Transforming Work at Anthropic

…The Alignment & Safety and Post-training teams do the most front-end development (7.5% and 7.4%) with Claude Code, often for creating data visualizations. The Security team often uses Claude…