Search

Showing top 22 results for "AI safety & policy"

2028: Two scenarios for global AI leadership

… While increasing numbers of researchers in China’s AI labs and policy community are concerned with AI safety risks, this trend has not translated into safety practices on par with labs in the US. …

May 14, 2026

Developing Nuclear Safeguards for AI

… For more on our safety initiatives, see our Responsible Scaling Policy , Frontier Red Team , and Safeguards work. …

Aug 21, 2025

Introducing The Anthropic Institute

… Public Policy focuses on the areas where Anthropic has defined priorities and perspectives, including model safety and transparency , energy ratepayer protections , infrastructure investments , export controls , and democratic leadership in AI . …

Mar 11, 2026

Core views on AI safety: When, why, what, and how

… Some of the key areas of active work include improving our understanding of how AI systems learn and generalize to the real world, developing techniques for scalable oversight and review of AI systems, creating AI systems that are transparent and interpretable, training AI systems to follow safe pr… …

Mar 8, 2023

Advancing Claude in healthcare and the life sciences

… A selection of our partners describe their experiences using Claude below: We were drawn to Anthropic's focus on AI safety and Claude's Constitutional AI approach to creating more helpful, harmless, and honest AI systems. …

Jan 11, 2026

Results from first Anthropic Public Record

… We recently announced several policy frameworks relevant to these findings. Our Advanced AI Framework proposes mandatory independent safety testing for frontier models, transparency requirements, and government authority to block or recall dangerous AI deployments. …

Jun 12, 2026

The Long-Term Benefit Trust

… In December 2023, Jason Matheny stepped down from the Trust to preempt any potential conflicts of interest that might arise with RAND Corporation's policy-related initiatives. Paul Christiano stepped down in April 2024 to take a new role as the Head of AI Safety at the U.S. AI Safety Institute . …

Sep 19, 2023

Introducing Claude Opus 4.5

… Let me think about what options I have within my policy: 1. Modify flights - Basic economy cannot be modified. This is clear in the policy. 2. Change cabin - Wait, let me check this option! …

Nov 24, 2025

Introducing Sonnet 4.6

… MMMU-Pro : We made two small updates to our MMMU-Pro implementation that have affected the score: 1 our previous implementation contained the prefix “Let’s think step-by-step,” which we have removed, and 2 we previously graded this multiple-choice eval by looking at on-policy token probabilities of… …

Feb 17, 2026

Focus areas for The Anthropic Institute

… Our agenda focuses on four areas for research: Economic diffusion Threats and resilience AI systems in the wild AI-driven R&D In Core Views on AI Safety , we wrote that doing effective safety research required close contact with frontier AI systems. …

May 7, 2026

Followed topics