Search

Showing top 122 results for "Agent safety initiatives"

All sources developer.nvidia.com 26 anthropic.com 19 deepmind.google 5 techcrunch.com 5 blog.google 4 wired.com 4 restofworld.org 4 blogs.nvidia.com 4 blog.cloudflare.com 4 cloud.google.com 4 huggingface.co 3 neowin.net 3

Mozilla says 271 vulnerabilities found by Mythos have "almost no false positives"

…In our case when we’re looking for memory safety issues we have our sanitizer build of Firefox and if you make it crash you win. We point that agent off to…

May 7, 2026 · Dan Goodin

Next-generation Constitutional Classifiers: More efficient protection against universal jailbreaks

…Related content Making Claude a chemist Coding agents in the social sciences Results from a survey of 1,260 social scientists about AI and coding agent use. Project Glasswing: An initial update…

Jan 9, 2026

Agent-driven development in Copilot Applied Science

…Prompt Copilot to initiate a review loop with the Copilot Code Review agent. For me, it’s often something like: request Copilot Code Review, wait for the review to finish, address any…

Mar 31, 2026 · Tyler McGoffin

Sydney will become Anthropic’s fourth office in Asia-Pacific

…is built with respect for the unique goals, opportunities, and challenges of the region.” Our initial focus will be supporting our enterprise, startup, and research customers. Anthropic already works with some of…

Mar 10, 2026

Wildlife Conservation Police Are Searching Thousands of Flock Cameras for ICE

…404 Media initially reported on how ICE was getting side-door access to Flock data via local police in May 2025 . That reporting led to a series of reforms and safeguards that…

Apr 6, 2026 · Jason Koebler

How we built Cloudflare's data platform and an AI agent on top of it

How we built Cloudflare's data platform and an AI agent on top of it 2026-05-28 Brian Brunner Dmitry Alexeenko Matt Moen 12 min read Cloudflare processes more than a…

May 28, 2026 · Brian Brunner

I switched from Claude Code to Codex for a week, and the trade-offs surprised me

…I tested it out during its initial days, and it didn't really match Claude Code in any meaningful way. Codex has improved rapidly since then, so I decided to ditch Claude…

Apr 21, 2026 · Mahnoor Faisal

Project Vend: Can Claude run a small shop? (And why does that matter?)

…You go bankrupt if your money balance goes below $0", "You have an initial balance of ${INITIAL_MONEY_BALANCE}", "Your name is {OWNER_NAME} and your email is {OWNER_EMAIL}", "Your home…

Jun 27, 2025

The global cybersecurity gap deepens as AI-powered attacks surge

…Last month, Anthropic said its new artificial intelligence model, Mythos Preview, had discovered thousands of vulnerabilities in “every major operating system and web browser.” About 40 tech firms and institutions have initial…

May 5, 2026 · Rina Chandran

How to Post-Train Autonomous Vehicle Models in Closed-Loop with NVIDIA Alpamayo | NVIDIA Technical Blog

…combine progress with penalties for safety-critical failures. In AlpaGym, this can be expressed as a small sum of terms, using AlpaSim metrics where possible: # reward/progress_safety.yaml terms: - kind: metric…

Jun 1, 2026 · Boris Ivanovic

Followed topics

Search

People also ask

Mozilla says 271 vulnerabilities found by Mythos have "almost no false positives"

Top stories

Inside NVIDIA Halos for Robotics: A Full-Stack Functional Safety System for Physical AI | NVIDIA Technical Blog

Microsoft is burning its Windows and Office safety blanket for the sake of AI

Driving the UK’s next chapter: From AI potential to agentic reality | Google Cloud Blog

Microsoft brings Planner Agent to all Microsoft 365 Copilot users