Search: AI developer practices

Demystifying evals for AI agents

…Bolt, Sierra, Vals.ai, Macroscope, PromptLayer, Stripe, Shopify, the Terminal Bench team, and more. This work reflects the collective efforts of several teams who helped develop the practice of evaluations at Anthropic…

Jan 9, 2026

KPMG integrates Claude across its core business and workforce of more than 276,000 in strategic alliance

…US, including its AI and Data Labs, and the firm's internal teams. As the rollout expands, KPMG and Anthropic will work with shared clients to co-develop new offerings, and modernize…

May 19, 2026

From shortcuts to sabotage: natural emergent misalignment from reward hacking

…We recommend inoculation prompting using language such as that as a practical mitigation that AI developers could adopt to mitigate the risk of reward hacking leading to more dangerous forms of misalignment…

Nov 21, 2025

Building Effective AI Agents

…Agents in practice Our work with customers has revealed two particularly promising applications for AI agents that demonstrate the practical value of the patterns discussed above. Both applications illustrate how agents add…

Dec 19, 2024

2028: Two scenarios for global AI leadership

…Forum have all publicly condemned the practice of distillation attacks. AI experts in China openly acknowledge distillation attacks’ scale and importance to China’s AI development. A recent article in a state…

May 14, 2026

Claude for Financial Services

…municipal bonds through their 10X Analyst KPMG helps financial services companies deploy AI assistants and agents to their developers and analysts PwC breaks down regulations into discrete obligations, analyzes internal compliance gaps…

Jul 15, 2025

Harness design for long-running application development

Engineering at Anthropic Harness design for long-running application development Written by Prithvi Rajasekaran, a member of our Labs team. Over the past several months I’ve been working on two interconnected…

Mar 24, 2026

Project Fetch: Can Claude train a robot dog?

…monitoring the potential for AI to automate and accelerate the development of future generations of AI. This is one of the capability thresholds included in Anthropic’s Responsible Scaling Policy because of…

Nov 12, 2025

How AI Is Transforming Work at Anthropic

…This includes examining how we bring teams together and collaborate with each other, how we support professional development, and/or how we establish best practices for AI-augmented work (e.g. guided…

Dec 2, 2025

Introducing Claude Opus 4.8

…As we build fiduciary-grade AI systems for legal and tax professionals, advances like these help raise the standard for trusted AI performance in real-world workflows. Claude Opus 4.8 sets…

May 28, 2026

Followed topics