Paper page - When to Think, When to Speak: Learning Disclosure Policies for LLM Reasoning
…Jiaqi Wei , , , , , , Qingyun Wang , Abstract Side-by-Side Interleaved Reasoning enables controlled disclosure timing in autoregressive models, improving accuracy and efficiency through interleaved private reasoning and delayed content release. AI-generated summary…