Paper page - ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents
… Finally, we optimize ToolCUA with Online Agentic RL in a high-fidelity GUI-Tool environment, guided by a Tool-Efficient Path Reward that encourages appropriate tool use and shorter execution paths. …