Paper page - Synthetic Computers at Scale for Long-Horizon Productivity Simulation
…Scaling Evaluation of Long-Horizon Agents on Subjective Enterprise Tasks (2026) SWE-Next: Scalable Real-World Software Engineering Tasks for Agents (2026) A Subgoal-driven Framework for Improving Long-Horizon LLM Agents…
