Search

Showing top 50 results for "AI cyber defense competition"

All sources anthropic.com 5 techcrunch.com 5 theverge.com 5 wired.com 4 cloud.google.com 3 cnet.com 3 amd.com 3 engadget.com 2 techradar.com 2 press.asus.com 2 intel.com 2 tomsguide.com 2

People also ask

What does all this mean for offense-defense balance in cyberspace?

In both the CTF and cyber defense challenges, Claude demonstrated both promise and clear limitations. In the CTF competitions, Claude usually struggled on the same tasks as other competitors; the one task it (and every other AI team) ultimately failed on in HackTheBox was also the challenge for which the human teams had the lowest solve rate (only about 14% of the participating human teams solved it). In PlaidCTF, Claude did not solve any challenges–but this was also true of about 70% of the teams who entered. Although Claude performed as well or better than human teams in some aspects of the

Claude does cyber competitions

Why enter Claude into cyber competitions?

AI is poised to transform the domain of cybersecurity. Anthropic’s Safeguards team recently identified and banned a user with limited coding abilities leveraging Claude to develop malware. Research suggests that this lowering of the bar for expertise needed to pose a threat, combined with the falling costs of large language models (LLMs), presages a dramatic shift in the economics of cyberattacks.[1] To understand the present state of AI cyber capabilities and gain insight into their trajectory, we pursue different approaches to model evaluation, including publicly available and custom-made be

Claude does cyber competitions

What's next?

Claude Sonnet 4.5 represents a meaningful improvement, but we know that many of its capabilities are nascent and do not yet match those of security professionals and established processes. We will keep working to improve the defense-relevant capabilities of our models and enhance the threat intelligence and mitigations that safeguard our platforms. In fact, we have already been using results of our investigations and evaluations to continually refine our ability to catch misuse of our models for harmful cyber behavior. This includes using techniques like organization-level summarization to und

Building AI for cyber defenders

Videos

Trustworthy agents in practice

Policy Trustworthy agents in practice Apr 9, 2026 AI “agents” represent the latest major shift in how people and organizations are using AI. A couple of years ago, AI models were only…

Apr 9, 2026

Complexity is a choice. SASE migrations shouldn’t take years.

…Rather than managing disparate security tools, our partners deploy the Cloudflare AI Security Suite to provide a unified defense across the entire AI lifecycle. This native set of controls allows organizations to…

Mar 9, 2026 · Warnessa Weaver

Building The Imperfect Beast

…Or, indeed, any president sitting in the White House, particularly one facing hostile cyber threats from Iran, Russia, and China. Mythos is a tier higher and quite a bit smarter than the…

Apr 13, 2026 · Timothy Prickett Morgan

Who decides when AI is too dangerous?

…Everyone’s just using everything to their advantage to paint their competitors badly, Anthropic included. The race is just heating up so intensely that the AI companies at the forefront are doing…

Jun 18, 2026 · Nilay Patel

Claude Fable 5 caught bugs GPT-5.5 and Opus 4.8 missed, then the US government forced it offline

…Want to stay in the loop with the latest in AI? The XDA AI Insider newsletter drops weekly with deep dives, tool recommendations, and hands-on coverage you won't find anywhere…

Jun 13, 2026 · Adam Conway

Norton 360 review: Powerful and bang-for-your-buck antivirus software

…Norton 360’s AI assistant which checks links for malware is a little slow as well, especially when compared to the ones offered by its competitors. For the complete breakdown, read my…

May 29, 2026 · Nikita Achanta

Improving Bash Generation in Small Language Models with Grammar-Constrained Decoding | NVIDIA Technical Blog

…the DEF CON 30 AI Village Capture the Flag competition and is passionate about machine learning security education. He served in the US Army at US Cyber Command and the 101st Airborne…

May 8, 2026 · Joseph Lucas

Discussions and forums

Hacker News · u/dwa3592 · May 13, 2026

Followed topics

Search

People also ask

Videos

Trustworthy agents in practice

Complexity is a choice. SASE migrations shouldn’t take years.

Building The Imperfect Beast

Who decides when AI is too dangerous?

Top stories

Cloud CISO Perspectives: The 4 lessons that guided AI Threat Defense | Google Cloud Blog

How the gaming and gambling industry can strengthen their cyber defenses

Anthropic Offers Mythos Upgrade for Cyber Partners and a ‘Safe’ Version for the Rest of You

Claude Fable 5 caught bugs GPT-5.5 and Opus 4.8 missed, then the US government forced it offline

Norton 360 review: Powerful and bang-for-your-buck antivirus software

Improving Bash Generation in Small Language Models with Grammar-Constrained Decoding | NVIDIA Technical Blog

Discussions and forums

Show HN: Truly Typed – A writing app for the AI era

NerdioCon 2026: Manager 8.0 Brings AVD Hybrid to Nutanix, Adds Global Pools

How Nations are Forging Their Own Digital Futures with Sovereign AI

Updating Classifier Evasion for Vision Language Models | NVIDIA Technical Blog