Paper page - SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer
…Four core designs drive our architecture: (1) Hybrid Linear Attention combines frame-wise Gated DeltaNet (GDN) with softmax attention for memory-efficient long-context modeling. (2) Dual-Branch Camera Control ensures precise…