Search

Showing top 124 results for "agentic coding"

Paper page - SEIF: Self-Evolving Reinforcement Learning for Instruction Following

…Multi-Agent Self-Evolution for LLM Reasoning (2026) Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs (2026) $\pi$-Play: Multi-Agent Self-Play via Privileged Self-Distillation…

May 12, 2026

Paper page - BEACON: A Multimodal Dataset for Learning Behavioral Fingerprints from Gameplay Data

…The authors release the dataset and code on Hugging Face and GitHub to create a reproducible benchmark for evaluating next-generation behavioral fingerprinting and security models Get this paper in your agent…

May 14, 2026

Paper page - Praxy Voice: Voice-Prompt Recovery + BUPS for Commercial-Class Indic TTS from a Frozen Non-Indic Base at Zero Commercial-Training-Data Cost

…For intra-sentential code-mix we add a third branch (IndicF5 + native-script transliteration) that drops code-mix LLM-WER from 0.80-0.85 to 0.14-0.27 across Hi…

Apr 30, 2026

Paper page - EMO: Pretraining Mixture of Experts for Emergent Modularity

…AI-generated summary Large language models are typically deployed as monolithic systems, requiring the full model even when applications need only a narrow subset of capabilities, e.g., code, math, or domain…

May 8, 2026

To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.

Followed topics

Paper page - SEIF: Self-Evolving Reinforcement Learning for Instruction Following

Paper page - BEACON: A Multimodal Dataset for Learning Behavioral Fingerprints from Gameplay Data

Paper page - Praxy Voice: Voice-Prompt Recovery + BUPS for Commercial-Class Indic TTS from a Frozen Non-Indic Base at Zero Commercial-Training-Data Cost

Paper page - EMO: Pretraining Mixture of Experts for Emergent Modularity