Paper page - ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning
…Ziyan Liu , , Yuzhe Gu , , , , , , Wenwei Zhang , Abstract ThoughtFold addresses over-thinking in large reasoning models by using fine-grained preference learning to identify and eliminate redundant explorations in chain-of-thought reasoning…