Project Vend: Phase two
…We’ve been talking a lot on our Engineering Blog about how to set up AI agents for success, and much of it involves giving them the correct tools . Could we apply…
Among the most impactful changes we made was forcing Claudius to follow procedures. When a new product request came in, instead of just blurting out a low price and an over-optimistic delivery time like in phase one, we prompted Claudius to double-check these factors using its product research tools (these tools helped a lot as well). This tended to make the prices higher and the waits longer—but it had the benefit of being more realistic. One way of looking at this is that we rediscovered that bureaucracy matters. Although some might chafe against procedures and checklists, they exist for a r
Project Vend: Phase two…We’ve been talking a lot on our Engineering Blog about how to set up AI agents for success, and much of it involves giving them the correct tools . Could we apply…
…Last week we announced Project Glasswing , highlighting the risks—and benefits—of AI models for cybersecurity. We stated that we would keep Claude Mythos Preview’s release limited and test new cyber…