Search

Showing top 3 results for "skills efficiency"

Paper page - RewardHarness: Self-Evolving Agentic Post-Training

… This creates a data-efficiency gap: humans can often infer the target evaluation criteria from only a few examples, while models are usually trained on hundreds of thousands of comparisons. …

May 14, 2026

Paper page - Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

…Advancing Reasoning Frontiers via Skill Composition and Complexity Scaling (2026) ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning (2026) EvolveRouter: Co-Evolving Routing and Prompt for Multi-Agent Question Answering (2026) Beyond…

May 15, 2026

Paper page - FutureSim: Replaying World Events to Evaluate Adaptive Agents

…To efficiently measure this capability for realistic use-cases, we propose building grounded simulations that replay real- world events in the order they occurred. We build FutureSim, where agents forecast world events…

May 15, 2026

Followed topics

Paper page - RewardHarness: Self-Evolving Agentic Post-Training

Paper page - Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

Paper page - FutureSim: Replaying World Events to Evaluate Adaptive Agents