Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts | TechCrunch
…Anthropic later published research suggesting that models from other companies had similar issues with “agentic misalignment.” Apparently Anthropic has done more work around that behavior, claiming in a post on X , “We…
