Trustworthy agents in practice
… But when a task requires dozens of actions, repeated prompts can become a source of friction, and users sometimes tune them out. In Claude Code, we introduced a new feature, Plan Mode, to address this gap. …
… But when a task requires dozens of actions, repeated prompts can become a source of friction, and users sometimes tune them out. In Claude Code, we introduced a new feature, Plan Mode, to address this gap. …
… Ironically, this meant that a feature originally designed to provide oversight could arguably have the opposite effect—some users might simply stop paying attention. …
… We also explored potential policy responses in our recent blog post on ideas for AI-related economic policy . …
… The slots define your policy: what counts as trusted in your environment, what categories to block, what exceptions to carve out. Good defaults ship out of the box. You can start using auto mode immediately and extend the configuration iteratively as you work with the feature. …
… 7 If you’re interested in helping us with our efforts, we have job openings available for threat investigators , policy managers , offensive security researchers , research engineers , security engineers , and many others . …