Search

Showing top 21 results for "AI cyber defense competition" · filtered from 22 indexed

All sources anthropic.com 5 wired.com 4 theverge.com 3 techcrunch.com 2 engadget.com 1 theregister.com 1 tweaktown.com 1 cnet.com 1 nextplatform.com 1 xda-developers.com 1 en.wikipedia.org 1

People also ask

What does all this mean for offense-defense balance in cyberspace?

In both the CTF and cyber defense challenges, Claude demonstrated both promise and clear limitations. In the CTF competitions, Claude usually struggled on the same tasks as other competitors; the one task it (and every other AI team) ultimately failed on in HackTheBox was also the challenge for which the human teams had the lowest solve rate (only about 14% of the participating human teams solved it). In PlaidCTF, Claude did not solve any challenges–but this was also true of about 70% of the teams who entered. Although Claude performed as well or better than human teams in some aspects of the

Claude does cyber competitions

Why enter Claude into cyber competitions?

AI is poised to transform the domain of cybersecurity. Anthropic’s Safeguards team recently identified and banned a user with limited coding abilities leveraging Claude to develop malware. Research suggests that this lowering of the bar for expertise needed to pose a threat, combined with the falling costs of large language models (LLMs), presages a dramatic shift in the economics of cyberattacks.[1] To understand the present state of AI cyber capabilities and gain insight into their trajectory, we pursue different approaches to model evaluation, including publicly available and custom-made be

Claude does cyber competitions

What's next?

Claude Sonnet 4.5 represents a meaningful improvement, but we know that many of its capabilities are nascent and do not yet match those of security professionals and established processes. We will keep working to improve the defense-relevant capabilities of our models and enhance the threat intelligence and mitigations that safeguard our platforms. In fact, we have already been using results of our investigations and evaluations to continually refine our ability to catch misuse of our models for harmful cyber behavior. This includes using techniques like organization-level summarization to und

Building AI for cyber defenders

Videos

Claude does cyber competitions

… More research and development into AI-enabled cyber defense and resilience is needed to counter this development. Why enter Claude into cyber competitions? AI is poised to transform the domain of cybersecurity. …

Aug 9, 2025

In the Wake of Anthropic’s Mythos, OpenAI Has a New Cybersecurity Model—and Strategy

… OpenAI seemed to be seeking to differentiate its message on Tuesday by striking a less catastrophic tone and touting its existing guardrails and defenses while hinting at the need for more advanced protections in the long term. “We believe the class of safeguards in use today sufficiently reduce cy… …

Apr 14, 2026 · Lily Hay Newman

Daybreak is OpenAI's response to Anthropic's Claude Mythos - Engadget

… If you'll recall, Glasswing uses Anthropic's unreleased AI model, Claude Mythos Preview, to provide its clients' cyber defense needs. …

May 12, 2026 · Mariella Moon

Building AI for cyber defenders

… Adopting and experimenting with AI will be key for defenders to keep pace. We believe we are now at an inflection point for AI’s impact on cybersecurity. For several years, our team has carefully tracked the cybersecurity-relevant capabilities of AI models. …

Oct 3, 2025

Anthropic Teams Up With Its Rivals to Keep AI From Hacking Everything

… Done not carefully, this could be a meaningfully accelerant for attackers.” Project Glasswing partners, including some of Anthropic's competitors, struck a collaborative tone in statements as part of the launch. “Google is pleased to see this cross-industry cybersecurity initiative coming together,… …

Apr 7, 2026 · Lily Hay Newman

Cheap Chinese models are overtaking Anthropic

AI + ML Anthropic struggling with Chinese competition, its own safety obsession The maker of Claude faces headwinds as it rushes to go public Anthropic, riding a wave of goodwill after resisting demands from the US Defense Department to soften model safeguards, is reportedly planning to go public a… …

Mar 28, 2026 · Thomas Claburn

OpenAI launches Daybreak: GPT-5.5 cybersecurity platform to find software vulnerabilities

… The platform aims to embed cyber defense into software from the beginning of the development process, not just as an afterthought. Daybreak builds on the success of OpenAI's earlier GPT-5.4-Cyber model, which helped remediate over 3,000 security flaws. …

May 12, 2026 · Jak Connor

World leaders want American AI. They just don't want America to be able to turn it off. | TechCrunch

… View Bio Most Popular SpaceX to acquire Cursor for $60B in stock, days after blockbuster IPO The US government’s Anthropic models ban was never about an AI jailbreak The AI layoff wave is becoming a powder keg Amazon CEO reportedly raised Anthropic model concerns before government crackdown The FBI…

Jun 17, 2026 · Rebecca Bellan

ChatGPT's market share slips below 50% for first time | TechCrunch

… View Bio Most Popular The AI layoff wave is becoming a powder keg Amazon CEO reportedly raised Anthropic model concerns before government crackdown The FBI built its own replica small town to simulate real-world cyberattacks Meta’s months-old AI unit is a soul-crushing gulag, say the engineers stuc…

Jun 16, 2026 · Ivan Mehta

OpenAI's Sam Altman Meets With Trump in Wake of Executive Order on AI

… Altman also met with Republican and Democratic members of Congress, including House Speaker Mike Johnson, who told CNBC they had a "very good, productive meeting." He said the two discussed recent developments in AI and what the "light touch" framework for regulation will be to "prevent some of the… …

Jun 5, 2026 · See full bio

Followed topics

People also ask

Videos

Claude does cyber competitions

In the Wake of Anthropic’s Mythos, OpenAI Has a New Cybersecurity Model—and Strategy

Daybreak is OpenAI's response to Anthropic's Claude Mythos - Engadget

Building AI for cyber defenders

Anthropic Teams Up With Its Rivals to Keep AI From Hacking Everything

Cheap Chinese models are overtaking Anthropic

OpenAI launches Daybreak: GPT-5.5 cybersecurity platform to find software vulnerabilities

World leaders want American AI. They just don't want America to be able to turn it off. | TechCrunch

ChatGPT's market share slips below 50% for first time | TechCrunch

OpenAI's Sam Altman Meets With Trump in Wake of Executive Order on AI