Paper page - Compliance versus Sensibility: On the Reasoning Controllability in Large Language Models
…The following papers were recommended by the Semantic Scholar API Deliberative Alignment is Deep, but Uncertainty Remains: Inference time safety improvement in reasoning via attribution of unsafe behavior to base model (2026…