Paper page - Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents
…At the data level, RoTS is a scalable synthesis framework that creates 800k high-quality data via a tree-based pipeline that proactively discovers diverse error modes and synthesizes corresponding recovery steps…