Paper page - MASCing: Configurable Mixture-of-Experts Behavior via Activation Steering Masks
…yielding a difficult-to-control mechanism that can vary across safety-relevant scenarios. At the same time, adapting model behavior through full fine-tuning or retraining is costly, especially when developers need…