Measuring AI agent autonomy in practice
… Product developers should design for user oversight. Effective oversight of agents requires more than putting a human in the approval chain. …
… Product developers should design for user oversight. Effective oversight of agents requires more than putting a human in the approval chain. …
… For example, how naval law treats abandoned ships has relevance to how the law might treat agents that run without human oversight. …
… Integrated users have the same appetite for regulation and oversight Integrated users were more trusting of every institution we asked about, including AI companies—and were markedly less inclined to say AI development should be slowed or stopped. …
… The main goal of scalable oversight is to get models to better understand and behave in accordance with human values. Another key feature of scalable oversight, especially techniques like CAI, is that they allow us to automate red-teaming aka adversarial training . …
… These usage data corroborate the survey data: engineers delegate increasingly complex work to Claude and Claude requires less oversight. …
… Appendix: The numbers Total Claude sessions 270 Messages exchanged 51,248 Input tokens ~27.5M Output tokens ~8.6M Draft versions 110 CPU hours for simulations ~40 Human oversight time ~50–60 hours Matthew Schwartz is a professor of physics at Harvard University. …
…We regularly run our models against roughly a thousand open source repositories from the OSS-Fuzz corpus , and grade the worst crash they can produce on a five-tier ladder of increasing…