New Microsoft tool lets devs spin up AI behavior tests using text descriptions | TechCrunch
… The framework, according to Microsoft, fills a gap that broader, more general evaluations cannot when AI models are intended to behave in a manner that is shaped by an application or product’s context, policies, and tools. “One of the things we’ve learned is that evaluations are absolutely critical… …