Paper page - AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning
…AI-generated summary Reinforcement learning (RL) has substantially improved the ability of large language model (LLM) agents to interact with environments and solve multi-turn tasks. However, effective agentic RL remains challenging…