Search

Showing top 129 results for "AI safety safeguards"

All sources anthropic.com 13 xda-developers.com 13 engadget.com 9 wired.com 8 theverge.com 6 techcrunch.com 6 cnet.com 6 tomsguide.com 5 blog.google 5 theregister.com 4 fudzilla.com 4 arstechnica.com 4

Nanny state vs. Linux: show us your ID, kid

OSes Nanny state discovers Linux, demands it check kids' IDs before booting Age-verification laws target operating systems because apparently teenagers having root access is now a safeguarding crisis OPINION A new…

Mar 13, 2026 · Steven J. Vaughan-Nichols

App alerts users to nearby camera-equipped smart glasses

…A new app just released on the App Store is the perfect example of safeguards should be implemented when Apple launches its smart glasses. Meta Ray-Ban Display. Image source: Meta The…

Mar 26, 2026 · Amber Neely

2028: Two scenarios for global AI leadership

…Opportunities for engagement on AI safety Anthropic supports international AI safety dialogue with AI experts in China, when possible. The world has a vested interest in safe AI, regardless of where it…

May 14, 2026

Trustworthy agents in practice

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Apr 9, 2026

Discussions and forums

Hacker News · u/netfortius · 4w ago

"An (important) message from Infomaniak's founder"

Hello ...,I'm writing to you as the founder and strategic director of Infomaniak because something important has just happened, and it concerns you directly.I no longer control InfomaniakIt's not a multinational that has…

7 1

Everything announced at Apple's WWDC 2026 keynote - Engadget

Everything announced at Apple's WWDC 2026 keynote Siri AI is finally almost here and Liquid Glass will look a little less liquid soon. By Kris Holt Updated: June 8, 2026 3…

Jun 8, 2026 · Kris Holt

Introducing Claude Opus 4.7

…the risks—and benefits—of AI models for cybersecurity. We stated that we would keep Claude Mythos Preview’s release limited and test new cyber safeguards on less capable models first. Opus…

Apr 16, 2026

Vibe Coding: Where it works and why doing it on an iPhone is a problem

Vibe coding is great for the App Store economy, but Apple is still wary about its use without safeguards in place. It's a fine balance that's going to…

May 19, 2026 · Malcolm Owen

Mapping AI-enabled cyber threats: Insights from the LLM ATT&CK Navigator

…It is calculated based on the actor's activity across Claude.ai, Claude Code, and our API, drawing on our safety classifiers alongside open-source and internal threat-intelligence indicators. The higher…

Jun 3, 2026

Measuring AI agent autonomy in practice

…Training models to recognize and act on their own uncertainty is an important safety property that complements external safeguards like permission systems and human oversight. At Anthropic, we train Claude to ask…

Feb 18, 2026

Anthropic co-founder Chris Olah's remarks on Pope Leo XIV's encyclical "Magnifica humanitas"

…May 25, 2026, Pope Leo XIV released an encyclical on the topic of AI: "Magnifica humanitas: On safeguarding the human person in the time of artificial Intelligence." Anthropic co-founder Chris Olah…

May 25, 2026

Followed topics

Search

People also ask