Paper page - SEIF: Self-Evolving Reinforcement Learning for Instruction Following
…Multi-Agent Self-Evolution for LLM Reasoning (2026) Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs (2026) $\pi$-Play: Multi-Agent Self-Play via Privileged Self-Distillation…