Paper page - On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment
…improving agent safety comes at the cost of degraded task performance . Such sparse and single-objective rewards severely limit real-world usability. To bridge this gap, we propose FATE, an on-policy…