Search

Showing top 121 results for "Agent safety research"

All sources developer.nvidia.com 24 anthropic.com 16 blogs.nvidia.com 15 xda-developers.com 10 huggingface.co 9 theregister.com 8 deepmind.google 6 github.blog 5 wired.com 3 blog.google 3 restofworld.org 3 techcrunch.com 2

Videos

Paper page - Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

…research-level-math-capabilities-of-llms-6768-150f563f . how can you robustly separate ill-posedness from policy-driven refusal across models with different safety configurations? Get this paper in your agent: hf…

May 12, 2026

Paper page - ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue

…However, existing UAV SAR research is dominated by traditional vision and path-planning methods and lacks a comprehensive and unified benchmark for embodied agents . To bridge this gap, we first propose the…

May 6, 2026

Google DeepMind partners with global consultancies to accelerate enterprise AI adoption.

…research and development. Looking ahead These efforts build upon Google Cloud’s work supporting global consulting partners, systems integrators, software partners, and specialized services providers as they implement and scale agentic AI…

Apr 22, 2026 · David Thacker

Teaching Claude why

…Thus, after Claude 4, it was clear we needed to improve our safety training and, since then, we have made significant updates to our safety training. We use agentic misalignment as a…

May 8, 2026

Can AI Really Build Better AI?

…He worries about research so risky happening “outside the public eye.” Krueger, who founded an AI-safety nonprofit called Evitable , advocates for globally pausing AI development. “It’s gambling with everyone’s…

May 7, 2026 · Matthew Hutson

Inside OpenAI’s Race to Catch Up to Claude Code

…By December 2024, several small groups inside of OpenAI were starting to focus on AI coding agents. One of them was led by Mishchenko and Thibault Sottiaux, a former Google DeepMind researcher…

Mar 11, 2026 · Maxwell Zeff

Paper page - IndustryBench: Probing the Industrial Knowledge Boundaries of LLMs

…enough when safety is on the line. We invite the community to explore the dataset and see how current models handle strict industrial constraints! Get this paper in your agent: hf papers…

May 13, 2026

Google DeepMind & Singapore: National AI partnership

…We are partnering with the National Research Foundation to train local researchers on agentic AI for science tools like Hypothesis Generation built with Co-Scientist, which are already showing promise in a…

May 20, 2026 · Google DeepMind

Claude Opus 4.6

…We’ve introduced agent teams in Claude Code as a research preview. You can now spin up multiple agents that work in parallel as a team and coordinate autonomously—best for tasks…

Feb 5, 2026

Architecting Security for Agentic Capabilities in Chrome

…Collaborating across the community We have a long-standing commitment to working with the broader security research community to advance security together, and this includes agentic safety. We’ve updated our Vulnerability…

Dec 8, 2025 · Nathan Parker

Followed topics