Search: data/control

Paper page - Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders

…Yi Jing , , , , , , Xiaozhi Wang Abstract SAERL uses Sparse Autoencoder-derived signals from model internals to enhance LLM reinforcement learning through diversity control, difficulty-aware curriculum learning, and quality-based data filtering. AI…

May 28, 2026

Paper page - Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

Papers arxiv:2605.02913 Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning Published on Apr 8 Submitted by Rohan Surana on May 6 McAuley-Lab Authors…

May 6, 2026

Paper page - Debiased Model-based Representations for Sample-efficient Continuous Control

Papers arxiv:2605.11711 Debiased Model-based Representations for Sample-efficient Continuous Control Published on May 12 Submitted by Jiafei Lyu on May 13 Tencent Hunyuan Authors: , , , Kai Yang , , , , Abstract DR.Q…

May 13, 2026

Paper page - ExoActor: Exocentric Video Generation as Generalizable Interactive Humanoid Control

…No dataset linking this paper Cite arxiv.org/abs/2604.27711 in a dataset README.md to link it from this page. No Space linking this paper Cite arxiv.org/abs/2604…

May 1, 2026

Paper page - MolmoAct2: Action Reasoning Models for Real-world Deployment

…datasets, open-weight action tokenizers, architectural redesign for continuous-action prediction, and adaptive reasoning for reduced latency. AI-generated summary Vision-Language-Action (VLA) models aim to provide a single generalist controller…

May 5, 2026

Paper page - PhyCo: Learning Controllable Physical Priors for Generative Motion

…Sriram Narayanan , , , Abstract PhyCo enhances video diffusion models with physics-based control through a large-scale dataset, physics-supervised fine-tuning, and vision-language model guidance for improved physical consistency. AI-generated…

May 2, 2026

Paper page - Urban-ImageNet: A Large-Scale Multi-Modal Dataset and Evaluation Framework for Urban Space Perception

…Instance segmentation Balanced 1K / 10K / 100K subsets support controlled scaling-behavior studies, alongside a full 2M-scale corpus for large-scale training. Dataset and code are publicly available on Hugging Face and…

May 13, 2026

Paper page - Diffusion Templates: A Unified Plugin Framework for Controllable Diffusion

…broad range of controllable generation tasks while preserving modularity, composability , and practical extensibility across rapidly evolving diffusion backbones . All resources will be open sourced, including code, models, and datasets. View arXiv page…

Apr 30, 2026

Paper page - LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

…the discovery environment must make the control space tractable and provide cheap, frequent feedback for TTS search. As a concrete instantiation, we formulate width--depth TTS as controller synthesis over pre-collected…

May 11, 2026

Paper page - TD3B: Transition-Directed Discrete Diffusion for Allosteric Binder Generation

…View arXiv page View PDF Add to collection Community No dataset linking this paper Cite arxiv.org/abs/2605.09810 in a dataset README.md to link it from this page. No…

May 12, 2026

Followed topics