Search: AI safety governance

Core views on AI safety: When, why, what, and how

Announcements Core views on AI safety: When, why, what, and how Mar 8, 2023 We founded Anthropic because we believe the impact of AI might be comparable to that of the industrial…

Mar 8, 2023

Focus areas for The Anthropic Institute

…Economic diffusion Threats and resilience AI systems in the wild AI-driven R&D In Core Views on AI Safety , we wrote that doing effective safety research required close contact with frontier…

May 7, 2026

More details on Fable 5’s cyber safeguards and our jailbreak framework

…First, we provide more information on the cybersecurity safeguards —specifically, the safety classifiers —that we launched with the model. These are the AI systems that accompany the model that detect and block…

Jul 2, 2026

A “diff” tool for AI: Finding behavioral differences in new models

…In recent years, AI safety researchers have started to apply this same principle to neural networks. This is known as model diffing . Previous work has shown that model diffing is a powerful…

Mar 13, 2026

LLMs and biorisk

…In this post, we want to expand on our perspective on AI and biological risk (biorisk). It is striking—but not necessarily intuitive—that every safety framework released by frontier AI labs…

Sep 5, 2025

Anthropic opens Milan office to support Italian enterprise, research, and developers

…religious traditions, civil society, academia, and governments—to shape a positive outcome for humanity. Anthropic’s frontier AI capabilities and an abiding commitment to safety have already earned the trust of Italian…

May 27, 2026

Sydney will become Anthropic’s fourth office in Asia-Pacific

…that democracies should lead in AI development, and one that aligns with the Australian government's own ambitions to become a trusted destination for sustainable AI infrastructure. We're exploring adding local…

Mar 10, 2026

Expanding Project Glasswing

…The role of Project Glasswing Project Glasswing and the capabilities of Claude Mythos Preview have sparked broad conversations—both within the software industry and with governments—about how AI is changing cybersecurity…

Jun 2, 2026

Introducing The Anthropic Institute

…Public Policy focuses on the areas where Anthropic has defined priorities and perspectives, including model safety and transparency , energy ratepayer protections , infrastructure investments , export controls , and democratic leadership in AI . Sarah Heck…

Mar 11, 2026