Paper page - SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search
…SAAS introduces three key components: (i) a search boundary modeling mechanism, which identifies the search boundary under the evolving policy by contrasting search-disabled and search-enabled rollouts; (ii) a boundary-aware…