Anthropic's safety warnings may have just backfired — the government has pulled the plug on its most powerful AI | TechCrunch
… Anthropic’s broader argument is that its strongest safeguards operate through independent classifier systems that function separately from the model itself, meaning that even if someone convinces Fable to keep talking past a refusal, the underlying protections against the most dangerous outputs rem… …