Paper page - Agent-ValueBench: A Comprehensive Benchmark for Evaluating Agent Values
… Together these results signal that the agent-alignment lever is shifting from classical model alignment and prompt steering toward harness alignment and skill steering.