Project Glasswing: what Mythos showed us
…That one thing might be a single complex feature, transitions across security boundaries, or a specific vulnerability class like command injections, where attacker input ends up being run as a shell command…
…That one thing might be a single complex feature, transitions across security boundaries, or a specific vulnerability class like command injections, where attacker input ends up being run as a shell command…
…This lets you inject secrets into requests outside the sandbox, so the agent never has access to them. This protects against exfiltration attacks. And sometimes internal services shouldn’t ever be exposed…
…And finally, we are collaborating with our research counterparts to explore solutions to potential exploits such as prompt injection in content and timing bypass. POSTED IN:
…can’t be retained by third-party API providers, made vulnerable to model inversion attacks, or injected into agentic pipelines. You need air-gapped inference for tier-one sensitive workloads, differential privacy…
Interesting new research you may have heard of on attacking large audio language models. The attack is called AudioHijack and the part worth paying attention to is that adversarial clips built against open models transfe…
Hey HN! We're Dr. Kashyap Thimmaraju and Giuseppe Canale from Silicon Psyche. We've built Posture Sequence Analysis (PSA), a behavioural health monitor for LLMs and AI Agents.Why we built PSAWe built PSA because we wante…
…This creates a large privacy attack surface: plaintext prompts and logs may contain PII , medical/financial data, credentials cloud memory stores can leak via retrieval, prompt injection, inversion, or misconfiguration naïve mitigation…
…Automatically discover and label every LLM endpoint exposed to the internet, providing immediate visibility into your AI attack surface. Request validation : Prevent "AI-jacking" by blocking prompt injections and malicious inputs designed…
…of “sudo,” the Linux “superuser” command. OpenClaw is also worryingly vulnerable to “prompt injection” attacks, which aim to trick an LLM into ignoring its guardrails and do things like leak…
…in the same May 11 TanStack attack, that some credential material was exposed, and that signing keys for Windows, macOS, iOS, and Android were impacted, prompting it to re-sign its apps…
…Finally, the model also shows significant improvement in agentic safety, meaning it's a lot better at recognizing and refusing prompt injection attacks when you're using it as an agent. Opus…
…Team agentic workflows and find points of exploitability and vulnerabilities like prompt injection, jail break, tool poisoning, and other custom attacks. Visualize the results on a dashboard and analyze risks. Apply pluggable…