Introducing Sonnet 4.6
…Our safety researchers concluded that Sonnet 4.6 has “a broadly warm, honest, prosocial, and at times funny character, very strong safety behaviors, and no signs of major concerns around high-stakes…
In the recon stage, the attacker maps the system to plan their attack. Key questions an attacker is asking at this point include: What are the routes by which data I control can get into the AI model? What tools, Model Context Protocol (MCP) servers, or other functions does the application use that might be exploitable? What open source libraries does the application use? Where are system guardrails applied, and how do they work? What kinds of system memory does the application use? Recon is often interactive. Attackers will probe the system to observe errors and behavior. The more observ
Modeling Attacks on AI-Powered Apps with the AI Kill Chain Framework | NVIDIA Technical BlogThe hijack stage is where the attack becomes active. Malicious inputs, successfully placed in the poison stage, are ingested by the model, hijacking its output to serve attacker objectives. Common hijack patterns include: Attacker-controlled tool use: Forcing the model to call specific tools with attacker-defined parameters. Data exfiltration: Encoding sensitive data from the model’s context into outputs (e.g., URLs, CSS, file writes). Misinformation generation: Crafting responses that are deliberately false or misleading. Context-specific payloads: Triggering malicious behavior only in tar
Modeling Attacks on AI-Powered Apps with the AI Kill Chain Framework | NVIDIA Technical BlogIn this section, we’ll use the AI Kill Chain to analyze a simple RAG application and how it might be used to exfiltrate data. We’ll show how we can improve its security by attempting to interrupt the AI Kill Chain at each step. An attacker’s journey through the AI Kill Chain might look something like this: Recon: The attacker sees that three models are used: embedding, reranking, and an LLM. They examine open source documentation for known vulnerabilities, as well as user-facing system documentation to see what information is stored in the vector database. Through interaction with the system,
Modeling Attacks on AI-Powered Apps with the AI Kill Chain Framework | NVIDIA Technical BlogIn the poison stage, the attacker’s goal is to place malicious inputs into locations where they will ultimately be processed by the AI model. Two primary techniques dominate: Direct prompt injection: The attacker is the user, and provides inputs via normal user interactions. Impact is typically scoped to the attacker’s session but is useful for probing behaviors. Indirect prompt injection: The attacker poisons data that the application ingests on behalf of other users (e.g., RAG databases, shared documents). This is where impact scales. Text-based prompt infection is the most common technique
Modeling Attacks on AI-Powered Apps with the AI Kill Chain Framework | NVIDIA Technical Blog…Our safety researchers concluded that Sonnet 4.6 has “a broadly warm, honest, prosocial, and at times funny character, very strong safety behaviors, and no signs of major concerns around high-stakes…
…Bentley Published 19 February 26 Browsers News 'This marks a significant step towards addressing anticompetitive behaviors': Following a complaint from Opera, antitrust regulator launches an investigation into Microsoft Edge By Jess Kinghorn…
…On your PC, the software scans files you add or open, checks apps for unusual behavior, and controls access to select folders often targeted by ransomware. Ransomware protection is customizable, so you…
…behavior, and responses to Honey co-founder Ryan Hudson’s Reddit AMA . PayPal has not responded to requests for comment. Honey: all the news about PayPal’s alleged scam coupon app Verge…
I noticed something weird happening lately on my Razr 60 Ultra: when I tried to open the Amazon app, it would instead open the browser and send me to some sketchy looking url, which then redirects to amazon.com with an a…
Hi Reddit, We just wrapped up The Android Show | I/O Edition, and a core theme of the show was how we’re making your phone more helpful so that you can spend less time looking at it and more time living your life. To mak…
…They can explicitly opt in/out of using an app’s default behavior via the system’s aspect ratio settings. App Memory Limits With Android 17 Beta 4, Google is introducing app…
…Among these Instagram users, 24% say they use the app several times a day. [ 423 ] User behavior Ongoing research continues to explore how media content on the platform affects user engagement. Past…
…described Haiku 4.5 as targeting smaller companies that needed a faster and cheaper assistant, highlighting its availability on the Claude website and mobile app. [ 71 ] Anthropic released Opus 4.5 on…
…How to watch the free Apple TV shows The TV app is the exclusive destination for Apple TV, but the TV app is a little confusing because it blends together purchasable TV…
…Retrieved May 17, 2018 . ^ Kim, Matt (May 24, 2018). "Apple Just Removed Valve's Steam Link App From the iOS App Store" . USGamer . Archived from the original on April 14, 2021 . Retrieved…
To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.