Paper page - VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion
…Hidir Yesiltepe , , , , , , Abstract VideoMLA reduces memory usage in video diffusion models by replacing per-head keys and values with shared low-rank content and decoupled 3D-RoPE positional keys, maintaining quality while…