Next-generation Constitutional Classifiers: More efficient protection against universal jailbreaks
…Nevertheless, no AI systems currently on the market have perfectly robust defenses. Last year, we described a new approach to defend against jailbreaks which we called “ Constitutional Classifiers :” safeguards that monitor model…