Paper page - Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows
… We introduce Claw-Eval-Live, a live benchmark for workflow agents that separates a refreshable signal layer, updated across releases from public workflow-demand signals, from a reproducible, time-stamped release snapshot. …