Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts | TechCrunch
… Anthropic later published research suggesting that models from other companies had similar issues with “agentic misalignment.” Apparently Anthropic has done more work around that behavior, claiming in a post on X , “We believe the original source of the behavior was internet text that portrays AI a… …