Paper page - Do not copy and paste! Rewriting strategies for code retrieval
… AI-generated summary Embedding-based code retrieval often suffers when encoders overfit to surface syntax. …
… AI-generated summary Embedding-based code retrieval often suffers when encoders overfit to surface syntax. …
… StraTA samples a compact strategy from the initial task state, conditions subsequent actions on that strategy, and trains strategy generation and action execution jointly with a hierarchical GRPO-style rollout design, further enhanced by diverse strategy rollout and critical self-judgment . …
… The following papers were recommended by the Semantic Scholar API PRAISE: Prefix-Based Rollout Reuse in Agentic Search Training 2026 Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model 2026 Accelerating RL Post-Training Rollouts via System-Integ… …
Papers arxiv:2605.08083 LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Published on May 8 Submitted by Chengsong Huang on May 11 3 Paper of the day Google Authors: Tong Zheng , , , , , , Runpeng Dai , , , Tianyi Xiong , , , Abstract AutoTTS automates test-time scaling strategy discove… …
… AI-generated summary The Model Context Protocol MCP has unified the interface between Large Language Models LLMs and external tools, yet a fundamental gap remains in how agents conceptualize the environments within which they operate. …
… In this work, we systematically study MoE compression in large-scale pretraining, focusing on three key questions: whether pruning provides a better initialization than training from scratch, how expert compression choices affect the final model after continued training , and which training strateg… …
… Interesting breakdown of this paper on arXivLens: https://arxivlens.com/PaperView/Details/tmas-scaling-test-time-compute-via-multi-agent-synergy-682-9aa2c46a Covers the executive summary, detailed methodology, and practical applications. …
… Moreover, we scale training to billions of images and incorporate a synthetic rendering engine to improve performance in text-rich scenarios. …
… This methodology enhances the feed-forward paradigm by integrating novel geometric constraints with a perspective-view training strategy , explicitly countering the primary sources of geometric error. This geometry-centric strategy yields a dramatic leap in both 3D accuracy and photorealism. …
… AI-generated summary Existing emotional support conversation ESC systems mainly rely on end-to-end response generation or coarse strategy supervision, offering limited interpretability and little support for systematic skill improvement. …