Search

Showing top 124 results for "Prompt injection attacks"

All sources theregister.com 17 huggingface.co 11 blog.google 9 xda-developers.com 9 anthropic.com 7 developer.nvidia.com 6 bleepingcomputer.com 6 blog.cloudflare.com 6 github.blog 6 pcworld.com 5 tomsguide.com 4 appleinsider.com 3

Prediction Guard De-Risks LLM Applications

…And they are also vulnerable to an emerging type of security threat known as prompt injections, in which an attacker uses a malicious input to elicit an unintended response or data breach…

· PDF

Anthropic's Claude can control your Mac by pretending it's a human user

…Anthropic is putting in guardrails to limit dangers, such as prompt injection. The firm adds that, though it is improving those safeguards, the threats against its infrastructure are always changing. To that…

Mar 24, 2026 · Malcolm Owen

I Gave Gemini Spark Access to My Life. Then It Friend-Zoned My Boyfriend

…A known issue with AI agents is how the tools make you vulnerable to prompt injection attacks, where bad actors essentially trick your agent into doing bad stuff with the data it…

May 29, 2026 · Reece Rogers

AI Development: Why We Need Guardrails

…You could additionally on that input side, scan your prompt before giving it to the LLM for these kind of prompt injections that we talked about. So our guardrail would actually scan…

· Katherine Druckman

Discussions and forums

r/netsec · u/snackymann · 4w ago

AudioHijack: adversarial audio attacks on generative voice models transfer from open weights to Microsoft and Mistral production systems

Interesting new research you may have heard of on attacking large audio language models. The attack is called AudioHijack and the part worth paying attention to is that adversarial clips built against open models transfe…

Hacker News · u/k-thimmaraju · 4w ago

Show HN: How to analyze your LLM output – A behavioural health monitor for LLMs

Hey HN! We're Dr. Kashyap Thimmaraju and Giuseppe Canale from Silicon Psyche. We've built Posture Sequence Analysis (PSA), a behavioural health monitor for LLMs and AI Agents.Why we built PSAWe built PSA because we wante…

9 5

Nvidia NemoClaw might finally make OpenClaw usable

…This was a gold mine for cyber threat actors leveraging prompt injection techniques , as they could quietly exfiltrate session tokens to remote servers. Some malicious skills designated for the agent to extend…

Mar 27, 2026 · Abhinav Raj

Scaling MCP adoption: Our reference architecture for simpler, safer and cheaper enterprise deployments of MCP

…Local MCP server deployments may rely on unvetted software sources and versions, which increases the risk of supply chain attacks or tool injection attacks . They prevent IT and security administrators from administrating…

Apr 14, 2026 · Sharon Goldberg

OpenClaw promised a self-hosted AI assistant I could actually leave running, but Hermes Agent is the one that delivers it

…The gateway is still an attack surface to think about, and prompt injection isn't solved by anyone yet (nor does it look like it ever can be), but Hermes treats security…

May 19, 2026 · Adam Conway

Followed topics

Search

Prediction Guard De-Risks LLM Applications

Top stories

Paper page - POISE: Position-Aware Undetectable Skill Injection on LLM Agents

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks | TechCrunch

ChatGPT just gave Free users a powerful defense against prompt injection attacks

OpenAI rolls out a Lockdown Mode for extra protection against prompt injection attacks - Engadget