Higher usage limits for Claude and a compute deal with SpaceX
…We train and run Claude on a range of AI hardware—AWS Trainium, Google TPUs, and NVIDIA GPUs—and continue to explore opportunities to bring additional capacity online. As part of this…
The core idea is to train Claude to explain its own activations. But how do we know whether an explanation is good? Since we don't know what thoughts an activation actually encodes, we can't directly check whether an explanation is accurate. So we train a second copy of Claude to work backwards—reconstruct the original activation from the text explanation. We consider an explanation to be good if it leads to an accurate reconstruction. We then train Claude to produce better explanations according to this definition using standard AI training techniques. In more detail, suppose we have a langua
Natural Language AutoencodersBefore we started this research, it was not clear where the misaligned behavior was coming from. Our main two hypotheses were: Our post-training process was accidentally encouraging this behavior with misaligned rewards.This behavior was coming from the pre-trained model and our post-training was failing to sufficiently discourage it. We now believe that (2) is largely responsible. Specifically, at the time of Claude 4’s training, the vast majority of our alignment training was standard chat-based Reinforcement Learning from Human Feedback RLHF data that did not include any agentic tool use. T
Teaching Claude why…We train and run Claude on a range of AI hardware—AWS Trainium, Google TPUs, and NVIDIA GPUs—and continue to explore opportunities to bring additional capacity online. As part of this…
…The $100 million we committed in March funds partner training, dedicated technical support, and shared marketing. Firms that join now also get priority access to new certifications as we introduce them. What…
…Don’t experts know more about biology than AI models? LLMs are trained on vast amounts of data ranging from financial models to fanfiction. In the course of this training, they learn…
…As a side effect of this shift, work that used to require years of specialized training can increasingly be done more quickly and cheaply with AI. The rate of progress raises sociological…
…We won’t use this data to train new Claude models, or for any non-safety-related purpose, and we’ve instituted new privacy protections including logging all human access to the…
…As part of this, we plan to develop ways to advance AI education and training within the workforce. Our recent Economic Index data shows that Australians already use Claude for a broader…
Interpretability Signs of introspection in large language models Oct 29, 2025 Read the paper Have you ever asked an AI model what’s on its mind? Or to explain how it came…
…What’s behind these behaviors? The way modern AI models are trained pushes them to act like a character with human-like characteristics. In addition, these models are known to develop rich…
Policy Focus areas for The Anthropic Institute May 7, 2026 At The Anthropic Institute (TAI), we’ll be using the information we can access from within a frontier lab to investigate AI…
…KPMG becomes a preferred consultant for deploying Claude and Anthropic's agents into those portfolio companies—helping them with direct access to Claude to build new AI-driven products, processes, and services…