Paper page - Beyond Reasoning: Reinforcement Learning Unlocks Parametric Knowledge in LLMs
…learning improves large language model recall of parametric knowledge by redistributing probability mass toward correct answers, with gains driven primarily by reinforcing rare but learnable examples. AI-generated summary Reinforcement learning (RL…