Paper page - On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment
…Such sparse and single-objective rewards severely limit real-world usability. To bridge this gap, we propose FATE, an on-policy self-evolving framework that transforms verifier-scored failures into repair supervision…