Search: AI training practices

Paper page - SEIF: Self-Evolving Reinforcement Learning for Instruction Following

…large language model instruction-following capabilities through iterative difficulty adaptation and co-training of instructor and follower components. AI-generated summary Instruction following is a fundamental capability of large language models (LLMs…

May 12, 2026

Paper page - ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models

…Combining stochastic processes with diffusion models addresses combinatorial complexity limitations, accelerating training and enabling asynchronous generation across data modalities. AI-generated summary In this paper, we study an under-explored but important…

May 5, 2026

Paper page - Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes

Papers arxiv:2605.05724 Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes Published on May 7 Submitted by Ethan Ning on May 8 Carnegie Mellon University Authors: Jingjie…

May 8, 2026

Paper page - Efficient Training on Multiple Consumer GPUs with RoundPipe

…https://arxivlens.com/PaperView/Details/efficient-training-on-multiple-consumer-gpus-with-roundpipe-5443-dde1eae1 Covers the executive summary, detailed methodology, and practical applications. Get this paper in your agent: hf papers…

May 1, 2026

Paper page - UniSD: Towards a Unified Self-Distillation Framework for Large Language Models

…in autoregressive language model adaptation through integrated mechanisms for supervision reliability, representation alignment, and training stability. AI-generated summary Self-distillation (SD) offers a promising path for adapting large language models (LLMs…

May 11, 2026

Paper page - LEAD: Length-Efficient Adaptive and Dynamic Reasoning for Large Language Models

…adapts reasoning efficiency during training by using online calibration of correctness-efficiency trade-offs and adaptive problem-specific length targets to improve mathematical reasoning accuracy and efficiency. AI-generated summary Large reasoning…

May 14, 2026

Followed topics

Search

Paper page - SEIF: Self-Evolving Reinforcement Learning for Instruction Following

Paper page - ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models

Paper page - Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes

Paper page - Efficient Training on Multiple Consumer GPUs with RoundPipe

Paper page - UniSD: Towards a Unified Self-Distillation Framework for Large Language Models

Paper page - LEAD: Length-Efficient Adaptive and Dynamic Reasoning for Large Language Models

Paper page - Everything at Every Scale: Scale-Invariant Diffusion with Continuous Super-Resolution

Paper page - FrontierSmith: Synthesizing Open-Ended Coding Problems at Scale

Paper page - EgoForce: Forearm-Guided Camera-Space 3D Hand Pose from a Monocular Egocentric Camera

Paper page - EMO: Pretraining Mixture of Experts for Emergent Modularity