Paper page - Discovering Cooperative Pipelines: Autoresearch for Sequential Social Dilemmas
…Observability-Driven Automatic Evolution of Coding-Agent Harnesses (2026) Reward Hacking Benchmark: Measuring Exploits in LLM Agents with Tool Use (2026) Continual Harness: Online Adaptation for Self-Improving Foundation Agents (2026) RoboPhD…