Paper page - A Causal Language Modeling Detour Improves Encoder Continued Pretraining
Papers arxiv:2605.12438 A Causal Language Modeling Detour Improves Encoder Continued Pretraining Published on May 12 Submitted by Rian Touchent on May 13 ALMAnaCH Inria Authors: Rian Touchent , Abstract Switching from Masked Language Modeling to Causal Language Modeling during encoder adaptation im… …