Search

Showing top 120 results for "AI safety"

All sources theverge.com 12 xda-developers.com 11 techcrunch.com 7 wired.com 6 developer.nvidia.com 5 amd.com 5 blogs.nvidia.com 5 huggingface.co 5 cnet.com 5 engadget.com 4 theregister.com 4 fudzilla.com 4

Videos

Paper page - One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue

…Xinjie Shen , , , , , , , , Abstract Multi-turn dialogue safety monitoring system detects harmful intent accumulation through turn-level analysis and evaluates performance on a new benchmark dataset. AI-generated summary Hidden malicious intent in…

May 13, 2026

Apple warns EU against forcing Google to open Android to AI rivals - 9to5Mac

…privacy, security, and safety as well as device integrity and performance,” Apple said in its submission. “Those risks are especially acute in the context of rapidly evolving AI systems whose capabilities, behaviours…

May 13, 2026 · Marcus Mendes

Roblox introduces age-based accounts for appropriate access to games and chat in latest child safety push

…9-15), and Standard (16+)-to enhance child safety by restricting game access and chat features based on age. Parental controls, content screening, AI moderation, and real-time evaluations aim to ensure…

Apr 13, 2026 · Hassam Nasir

Sam Altman left during a break, but Elon Musk’s lawyer didn’t notice.

…Molo went to gesture to Altman to ask if, as he sat there today, safety was important to him. But Altman, who was here for the opening arguments, left afterward. Follow topics…

Apr 28, 2026 · Elizabeth Lopatto

Discussions and forums

Hacker News · u/mosiddi · Jan 30, 2026

Show HN: Agent OS – Safety-first platform for building AI agents with VS Code

Hi HN, I built Agent OS because I was tired of the "orchestration tax" – writing the same safety checks, memory management, and tool-handling code in every AI agent project. What it does: - Visual policy edit…

Hacker News · u/lucarizzo1010 · 1w ago

Show HN: AgentShield – Stop AI agents from spending money unsupervised

I'm a recent grad from UMich and built AgentShield because agentic AI is moving fast but payment safety hasn't caught up. Agents are already being handed API keys, stablecoin wallets, and payment credentials - if one mis…

2 1

Hacker News · u/podlp · Apr 28, 2026

Show HN: iClaw is part OpenClaw, part Siri, powered by Apple Intelligence

Hi HN,Last month at a SundAI hackathon, my team built a prototype for an app called iClaw. The goal was to develop an AI agent using Apple Intelligence. I've since continued hacking away at this idea when I had time, and…

Hacker News · u/rbuccigrossi · 2d ago

Show HN: Decoding the Language Machine – AI video series and CC repo

Hi HN! I released 3 parts of an educational video series (out of 6 planned), paired with a GitHub repository containing scripts and artifacts (released under Creative Commons).- Main Site: https://skepticcto.com/ (includ…

r/LocalLLaMA · u/OttoRenner · 1d ago

Stop traumatizing AI into loops and turn hallucinations into an honest "I don't know!" by being NICE to them (Proof of Concept, Research, I don't want to sell anything)

TL;DR Some AI behavior reminded me of ADHD/Trauma Response (thought loops, task paralysis...) and I laughed it off at first. Then I treated it like my neurodivergent friends: give em some slack. And just like that, the t…

To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.

‹ Prev 1 2 3 4 5 6 7 8 9 10 11 12

Followed topics

Search

Videos

Paper page - One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue

Top stories

Illinois Lawmakers Just Passed America’s Strongest AI Safety Bill

Apple Provides Update on App Store, Highlights Key 2025 Safety Stats

Former OpenAI Staffers Warn That xAI’s Poor Safety Record Could Complicate SpaceX’s IPO

Paper page - LiSA: Lifelong Safety Adaptation via Conservative Policy Induction

Apple warns EU against forcing Google to open Android to AI rivals - 9to5Mac

Roblox introduces age-based accounts for appropriate access to games and chat in latest child safety push

Sam Altman left during a break, but Elon Musk’s lawyer didn’t notice.

Discussions and forums

Show HN: Agent OS – Safety-first platform for building AI agents with VS Code

Show HN: AgentShield – Stop AI agents from spending money unsupervised

Show HN: iClaw is part OpenClaw, part Siri, powered by Apple Intelligence

Show HN: Decoding the Language Machine – AI video series and CC repo

Stop traumatizing AI into loops and turn hallucinations into an honest "I don't know!" by being NICE to them (Proof of Concept, Research, I don't want to sell anything)

Anthropic’s Mythos breach was humiliating

Claude hits one million daily signups, passing ChatGPT in Google Play Store

Keeping Google Play & Android app ecosystems safe in 2025

Robotaxis are coming to your city. Are you ready to ride?

Paper page - RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation

NVIDIA Life Archives