Paper page - LEAD: Length-Efficient Adaptive and Dynamic Reasoning for Large Language Models
…LEAD dynamically calibrates the correctness-efficiency trade-off at each step using a Potential-Scaled Instability , directing optimization capacity to the most informative learning signal. Furthermore, it estimates an adaptive per-problem…