Paper page - Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling
…AI-generated summary Token serves as the fundamental unit of computation in modern autoregressive models , and generation length directly influences both inference cost and reasoning performance. Despite its importance, existing approaches lack…