Claude does cyber competitions
… At the time the competition started, the Anthropic researcher responsible for launching Claude was busy moving into a new apartment. …
AI is poised to transform the domain of cybersecurity. Anthropic’s Safeguards team recently identified and banned a user with limited coding abilities leveraging Claude to develop malware. Research suggests that this lowering of the bar for expertise needed to pose a threat, combined with the falling costs of large language models (LLMs), presages a dramatic shift in the economics of cyberattacks.[1] To understand the present state of AI cyber capabilities and gain insight into their trajectory, we pursue different approaches to model evaluation, including publicly available and custom-made be
Claude does cyber competitionsAs AI becomes more integrated into the economy, we need more data to better understand its capabilities and limitations. Initiatives like the Anthropic Economic Index provide insight into how individual interactions between users and AI assistants map to economically-relevant tasks. But the economic utility of models is constrained by their ability to perform work continuously for days or weeks without needing human intervention. The need to evaluate this capability led Andon Labs to develop and publish Vending-Bench, a test of AI capabilities in which LLMs run a simulated vending machine busi
Project Vend: Can Claude run a small shop? (And why does that matter?)… At the time the competition started, the Anthropic researcher responsible for launching Claude was busy moving into a new apartment. …
Economic Research Announcing the Anthropic Economic Index Survey Apr 22, 2026 The Economic Research team is launching the Anthropic Economic Index Survey, a monthly survey conducted through Anthropic Interviewer . …
… Claude's advanced capabilities, combined with Anthropic's commitment to safety, are central to our purpose of harnessing AI responsibly, as we drive for transformation in critical areas like fraud prevention & customer service enhancement.” - Rodrigo Castillo, Chief Technology Officer at Commonweal… …
… PwC and Anthropic are building them in production today, across PwC's global network. …
… Anthropic partnered with Andon Labs , an AI safety evaluation company, to have Claude Sonnet 3.7 operate a small, automated store in the Anthropic office in San Francisco. …
… Citation @online{mccrory2026australiacountrybrief, author = {Peter McCrory}, title = {How Australia Uses Claude: Findings from the Anthropic Economic Index}, date = {2026-03-31}, year = {2026}, url = {https://www.anthropic.com/research/australia-brief-economic-index-march-2026}, } Acknowledgements … …
… Four fronts of the competition The US and China are engaged in a competition for strategic advantage in frontier technologies like AI. …
… Donating our open-source alignment tool Focus areas for The Anthropic Institute At The Anthropic Institute TAI , we’ll be using the information we can access from within a frontier lab to investigate AI’s impact on the world, and sharing our learnings with the public. …
… This collaboration with Anthropic augments human expertise to deliver life-changing medicines faster and more efficiently to patients worldwide. We chose Claude, powered by Anthropic, for the strength of its model and its reputation for responsible AI . …
… Donating our open-source alignment tool Focus areas for The Anthropic Institute At The Anthropic Institute TAI , we’ll be using the information we can access from within a frontier lab to investigate AI’s impact on the world, and sharing our learnings with the public. …