Search: agent safety focus

Researchers gaslit Claude into giving instructions to build explosives

… While Garraghan says other chatbots are equally vulnerable to the kind of social attack the researchers used on Claude, they focused on Anthropic given the company’s self-proclaimed attention to safety and strong performance in other red-teaming efforts, including a study testing whether chatbots w… …

May 5, 2026 · Robert Hart

That UL logo is more complicated than it looks

… We focus on products. We focus on product safety. …

Apr 27, 2026 · Nilay Patel

Ronan Farrow on Sam Altman’s “unconstrained” relationship with the truth

… The safety stakes are so acute that they have not gone away. This is the reason this company was founded as a nonprofit focused on safety, and where things were being obscured in a way that credible people around this found it less than professional. …

Apr 16, 2026 · Nilay Patel

Followed topics

Search

Researchers gaslit Claude into giving instructions to build explosives

That UL logo is more complicated than it looks

Ronan Farrow on Sam Altman’s “unconstrained” relationship with the truth

360-degree cameras have a new superpower

Uber employees have a Dara AI

How to win — and lose — Decoder