Search

Showing top 102 results for "AI agents and safety"

All sources developer.nvidia.com 29 huggingface.co 14 theregister.com 10 blogs.nvidia.com 8 deepmind.google 5 anthropic.com 4 xda-developers.com 3 theverge.com 2 fudzilla.com 2 blog.google 2 semiwiki.com 2 wired.com 2

Videos

Anthropic blames dystopian sci-fi for training AI models to act “evil”

…The problem, the researchers theorize, is that this kind of RLHF safety training couldn’t possibly cover every single type of ethically difficult situation an agentic AI might encounter. When a modern…

May 13, 2026 · Kyle Orland

Hackers are learning to exploit chatbot ‘personalities’

…soon be used to break the AI agents coexisting with us in the real world — booking meetings, managing calendars, ordering food, handling customer service — and safety teams will need to ensure models…

May 24, 2026 · Robert Hart

Introducing Sonnet 4.6

…Our safety researchers concluded that Sonnet 4.6 has “a broadly warm, honest, prosocial, and at times funny character, very strong safety behaviors, and no signs of major concerns around high-stakes…

Feb 17, 2026

Microsoft, Nvidia claim AI speeds approval of nuclear plants

…by gutting the safety rules and skipping full environmental reviews for new reactors. AI, we're told, is expected to help by making highly complex work repeatable and predictable, and slashing development…

Mar 25, 2026 · Dan Robinson

Taiwan’s Industry Titans Turbocharge World’s AI Infrastructure Buildout With NVIDIA

Semiconductor and electronics manufacturing leaders are using NVIDIA AI to speed manufacturing from fabs to factory floors as they ramp up the production of NVIDIA Vera Rubin NVL72 infrastructure for agentic AI…

Jun 1, 2026 · Timothy Costa

Netflix, Meta, IBM speakers discuss AI and their workdays

…More to the point, optimal AI results favor the well-fortified agent, according to speakers from IBM, Meta, and Netflix – among others – at the All Things AI conference in Durham, North Carolina…

Apr 4, 2026 · Joab Jackson

NVIDIA Blog

…5 Ways NVIDIA AI Is Protecting the Planet April 22, 2026 NVIDIA and Google Cloud Collaborate to Advance Agentic and Physical AI April 22, 2026 Blowing Off Steam: How Power-Flexible AI…

May 10, 2026

Discussions and forums

Hacker News · u/mosiddi · Jan 30, 2026

Show HN: Agent OS – Safety-first platform for building AI agents with VS Code

Hi HN, I built Agent OS because I was tired of the "orchestration tax" – writing the same safety checks, memory management, and tool-handling code in every AI agent project. What it does: - Visual policy edit…

Hacker News · u/lucarizzo1010 · 2w ago

Show HN: AgentShield – Stop AI agents from spending money unsupervised

I'm a recent grad from UMich and built AgentShield because agentic AI is moving fast but payment safety hasn't caught up. Agents are already being handed API keys, stablecoin wallets, and payment credentials - if one mis…

2 1

Hacker News · u/podlp · Apr 28, 2026