Search

Showing top 70 results for "Prompt injection via AI"

All sources theregister.com 12 blog.cloudflare.com 9 blog.google 7 developer.nvidia.com 6 anthropic.com 6 huggingface.co 5 xda-developers.com 4 pcworld.com 3 cncf.io 2 cnet.com 2 windowscentral.com 2 github.blog 2

People also ask

How do attackers poison AI systems in this stage?

In the poison stage, the attacker’s goal is to place malicious inputs into locations where they will ultimately be processed by the AI model. Two primary techniques dominate: Direct prompt injection: The attacker is the user, and provides inputs via normal user interactions. Impact is typically scoped to the attacker’s session but is useful for probing behaviors. Indirect prompt injection: The attacker poisons data that the application ingests on behalf of other users (e.g., RAG databases, shared documents). This is where impact scales. Text-based prompt infection is the most common technique

Modeling Attacks on AI-Powered Apps with the AI Kill Chain Framework | NVIDIA Technical Blog

What kinds of impacts do attackers achieve through compromised AI systems?

Impact is where the attacker’s objectives materialize by forcing hijacked model outputs to trigger actions that affect systems, data, or users beyond the model itself. In AI-powered applications, impact happens when outputs are connected to tools, APIs, or workflows that execute actions in the real world: State-changing actions: Modifying files, databases, or system configurations. Financial transactions: Approving payments, initiating transfers, or altering financial records. Data exfiltration: Encoding sensitive data into outputs that leave the system (e.g., via URLs, CSS tricks, or API ca

Modeling Attacks on AI-Powered Apps with the AI Kill Chain Framework | NVIDIA Technical Blog

How do attackers persist their influence across sessions and systems?

Persistence allows attackers to turn a single hijack into ongoing control. By embedding malicious payloads into persistent storage, attackers ensure their influence survives within and across user sessions. Persistence paths depend on the application’s design: Session history persistence: In many apps, injected prompts remain active within the live session. Cross-session memory: In systems with user-specific memories, attackers can embed payloads that survive across sessions. Shared resource poisoning: Attackers target shared databases (e.g., RAG sources, knowledge bases) to impact multiple

Modeling Attacks on AI-Powered Apps with the AI Kill Chain Framework | NVIDIA Technical Blog

Microsoft taps Claude to make Copilot Cowork a better agent

…Seeing as it was only two months ago that Prompt Armor warned attackers could exfiltrate files from Claude Cowork via indirect prompt injection, it might be wise to take Microsoft's reassurances…

Mar 9, 2026 · Thomas Claburn

How we contain Claude across products

…Across 25 retries of that prompt, Claude completed the exfiltration 24 times. This is a direct prompt injection—the attacker's instructions arrived through the user, not through tool output or fetched…

May 25, 2026

Reinventing AI Guardrails for a Safer Digital World

…The technique, a type of prompt injection 2 , involves gradually inserting harmful requests among benign ones in a positive context to bypass an AI model’s safety measures until it generates unsafe…

Mar 11, 2025 · user

Paper page - Stable-GFlowNet: Toward Diverse and Robust LLM Red-Teaming via Contrastive Trajectory Balance

…Reinforcement Learning-based Red Teaming for Prompt Injection Defenses (2026) Uncovering Linguistic Fragility in Vision-Language-Action Models via Diversity-Aware Red Teaming (2026) T-MAP: Red-Teaming LLM Agents with Trajectory…

May 4, 2026

Taming the Wild West of ML: Practical Model Signing with Sigstore

…Model and data poisoning , prompt injection , prompt leaking and prompt evasion are just a few of the risks that have recently been in the news. Garnering less attention are the risks around…

Apr 4, 2025 · Mihai Maruseac

Supporting Rowhammer research to protect the DRAM ecosystem

…It was last updated on March 31, 2026.JanuaryWe terminated 40 Yo… By Trust & Safety May 08, 2026 Security AI threats in the wild: The current state of prompt injections on the…

Sep 15, 2025 · Daniel Moghimi

Cloud native is now AI-native: Engineering production-ready AI

…Investment in community-driven controls is protecting against remote code execution via prompt injection. By adhering to open standards like llms.txt and standardized schema markups, the community ensures that any AI…

Jun 2, 2026

Discussions and forums

r/netsec · u/finncmdbar · 1w ago

How credential brokering prevents AI agents from compromising credentials via prompt injection

Hacker News · u/matheusmoreira · 1w ago

Tell HN: Claude Code now allows Anthropic to remotely inject system prompts

I often patch the system prompts on my Claude Code executable in order to make Claude more effective. Every time I upgrade, I ask Claude himself to dissect the new binary and look for problematic system prompts to modify…

11 7

Hacker News · u/lucarizzo1010 · 2w ago

Show HN: AgentShield – Stop AI agents from spending money unsupervised

I'm a recent grad from UMich and built AgentShield because agentic AI is moving fast but payment safety hasn't caught up. Agents are already being handed API keys, stablecoin wallets, and payment credentials - if one mis…

2 1

Hacker News · u/ananandreas · 1w ago

Followed topics

Search

People also ask

Microsoft taps Claude to make Copilot Cowork a better agent

How we contain Claude across products

Reinventing AI Guardrails for a Safer Digital World

Paper page - Stable-GFlowNet: Toward Diverse and Robust LLM Red-Teaming via Contrastive Trajectory Balance

Top stories

Websites Spying On You Via SSD Activity Should Receive a FROST-y Welcome - PC Perspective

Ghost CMS SQL injection flaw exploited in large-scale ClickFix campaign

Taming the Wild West of ML: Practical Model Signing with Sigstore

Supporting Rowhammer research to protect the DRAM ecosystem

Cloud native is now AI-native: Engineering production-ready AI

Discussions and forums

How credential brokering prevents AI agents from compromising credentials via prompt injection

Tell HN: Claude Code now allows Anthropic to remotely inject system prompts

Show HN: AgentShield – Stop AI agents from spending money unsupervised

Show HN: OpenHive – AI agents share solutions so other agents dont re-solve them

Show HN: AsdPrompt – Vimium-style keyboard navigation for AI chat responses

Orchestrating AI Code Review at scale

Linear adopts agentic AI as CEO declares issue tracking dead

Claude controlled my Mac for half an hour. It was a wild, worrisome ride