Search: Community sharing

Paper page - UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

…UniVidX formulates pixel-aligned tasks as conditional generation in a shared multimodal space, adapts to modality-specific distributions while preserving the backbone's native priors, and promotes cross-modal consistency during synthesis…

May 4, 2026

Paper page - DECO: Sparse Mixture-of-Experts with Dense-Comparable Performance on End-Side Devices

…DECO utilizes the differentiable and flexible ReLU-based routing enhanced by learnable expert-wise scaling , which adaptively balances the contributions of routed and shared experts. Furthermore, we introduce NormSiLU , an activation function…

May 12, 2026

Paper page - RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

…In this work, we argue that rubrics should serve not merely as final-answer evaluators, but as the shared interface that structures policy execution , judge feedback , and agent memory . Based on this…

May 13, 2026

Paper page - DEMON: Diffusion Engine for Musical Orchestrated Noise

…Ryan Fosdick Abstract DEMON enables real-time diffusion model control as a musical instrument through specialized scheduling, shared state management, and optimized decoding techniques. Generated by Qwen/Qwen2.5-Coder-32B-Instruct…

Jun 1, 2026

Paper page - Covering Human Action Space for Computer Use: Data Synthesis and Benchmark

…Our analysis of failure cases from advanced models suggests a long-tail pattern in GUI operations , where a relatively small fraction of complex and diverse interactions accounts for a disproportionate share of…

May 14, 2026

Paper page - ReflectDrive-2: Reinforcement-Learning-Aligned Self-Editing for Discrete Diffusion Driving

…We also co-design an efficient reflective decoding stack for the decision--draft--reflect pipeline , combining shared-prefix KV reuse , Alternating Step Decode , and fused on-device unmasking . On NAVSIM , ReflectDrive-2…

May 8, 2026

Paper page - IndustryBench: Probing the Industrial Knowledge Boundaries of LLMs

…View arXiv page View PDF Project page GitHub 120 Add to collection Community We are excited to share IndustryBench , a new benchmark designed by the Multimodal and Industrial AI team at Alibaba…

May 13, 2026

Paper page - FineVerify: Scaling Test-Time Compute with Fine-Grained Self-Verification for Agentic Search

…Code and data are available at https://github.com/XuZhao0/fineverify View arXiv page View PDF GitHub 3 Add to collection Community Code and data are available at: https://github.com/XuZhao0…

Jun 2, 2026

Paper page - Make Each Token Count: Towards Improving Long-Context Performance with KV Cache Eviction

…Lightweight retention gates assign utility scores to cached KV entries, and a shared final scoring projection calibrates these scores across all layers and heads. This enables a single global eviction policy in…