Search

Showing top 47 results for "AI model rollout"

Paper page - Rubric-based On-policy Distillation

…AI-generated summary On-policy distillation (OPD) is a powerful paradigm for model alignment , yet its reliance on teacher logits restricts its application to white-box scenarios. We contend that structured semantic…

May 11, 2026

Paper page - Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

…AI-generated summary Reinforcement learning with verifiable rewards, particularly Group Relative Policy Optimization (GRPO), has significantly advanced the reasoning capabilities of Large Language Models (LLMs). However, in complex tasks, GRPO frequently suffers…

May 8, 2026

Paper page - Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation

…AI-generated summary Real-time interactive video generation requires low-latency, streaming, and controllable rollout. Existing autoregressive (AR) diffusion distillation methods have achieved strong results in the chunk-wise 4-step regime…

May 15, 2026

To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.

Followed topics

Search

Paper page - Rubric-based On-policy Distillation

Paper page - Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

Paper page - Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation

Paper page - Memory-Bound but Not Bandwidth-Limited: The Physical AI Inference Gap in Batch-1 LLM Decode

Paper page - OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper page - Orchard: An Open-Source Agentic Modeling Framework

Paper page - One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue