Paper page - BraveGuard: From Open-World Threats to Safer Computer-Use Agents
…This shift creates safety risks that are difficult to detect from isolated prompts or final responses, because harm often emerges only through multi-step execution traces whose individual actions appear locally benign…