The assistant axis: situating and stabilizing the character of large language models
…Research demonstration In collaboration with Neuronpedia, our researchers are also providing a research demo , where you can view activations along the Assistant Axis while chatting with a standard model and an activation…