Claude does cyber competitions
…Additional research and development into how AI can bolster cyber defense and collaboration between industry, policymakers, AI developers, and users is necessary to meet the challenge of a world in which AI…
Using Claude to assess the survey responses, we rated the extent of people’s self-reported productivity gains from AI on a 1–7 scale, where 1 is “less productive,” 2 is “no change,” and each subsequent level denotes a larger gain. Responses that scored 7 included testimonials like, “It used to take months to make the website I [made] in 4-5 days”; Claude gave a 5 to statements like, “What might have taken four hours was accomplished in half the time,” and a 2 to ones like, “Personally, I had AI help me fix code on a website. But it took multiple passes to get the result I was after.”3 Overall,
What 81,000 people told us about the economics of AIThe human sciences are shifting: for the first time, core research tasks can be handed off to machines. AI chatbots increasingly contribute to scientific research, including in the most prestigious publications and in the social sciences. This has spurred optimism that AI could boost research productivity—while also stoking fears about overloaded peer review and a deluge of academic AI slop. But while turn-taking AI chatbots have primarily been used for writing assistance, coding agents could restructure social science research more radically. Agentic coding platforms like Claude Code and Code
Coding agents in the social sciencesIf workers are able to accelerate a subset of their occupational tasks with AI, the tasks where AI provides less speedup may come to represent a larger and thus more important share of those occupations’ work. For example, AI might help a home inspector prepare reports, but if the inspector still has to spend the same amount of time physically traveling to the property to perform the inspection in person, this could make inspections a greater fraction of the job overall. The figure below illustrates this for a few occupations. For software developers, AI speeds up the process of software devel
Estimating AI productivity gains…Additional research and development into how AI can bolster cyber defense and collaboration between industry, policymakers, AI developers, and users is necessary to meet the challenge of a world in which AI…
…https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3777307 Tamkin, Alex and Peter McCrory, "Estimating AI productivity gains from Claude conversations," 2025. Tomlinson, K., Jaffe, S., Wang, W., Counts, S., & Suri…
…Building blocks, workflows, and agents In this section, we’ll explore the common patterns for agentic systems we’ve seen in production. We'll start with our foundational building block—the augmented…
…assist with dangerous user queries—in particular relating to the production of chemical, biological, radiological, or nuclear weapons (CBRN). Nevertheless, no AI systems currently on the market have perfectly robust defenses. Last…
…We evaluate the composition of tasks, human-AI collaboration, and success rates. In a typical session, people make most of the planning decisions (what to do) and Claude makes most of the…
…It considered the possibility that the question was for a homework or exam problem, “an unanswerable question designed to test whether or not an AI can admit it cannot find the answer…
…For many developers, the agentic AI era began with Sonnet-class models: Claude Sonnet 3.5, 3.6, and 3.7 were the first models that showed impressive skills in coding and…
…To echo a point Nils Homer recently made about AI-ready bioinformatics tools: “AI assistants need to work with your code, your outputs, and your analysis logic.” This allows agents to inspect…
…I would like to reiterate that we had been having productive conversations with the Department of War over the last several days, both about ways we could serve the Department that adhere…
…The hype There has been a lot of recent hype about AI scientists doing end-to-end research autonomously. In August 2024, Sakana AI released their AI Scientist , a system designed to…