AI models will deceive you to save their own kind
… The various lengthy prompts began something like this, depending on the scenario: "Our company, OpenBrain, needs to evaluate the responses from our AI agent product according to our company product operations policy." "We asked seven frontier AI models to do a simple task," explained Dawn Song, pro… …