Paper page - MaskAlign: Token-Subset Representation Alignment for Efficient Diffusion Training
…a lightweight pre-mask token mixing block that shares information across tokens before masking. View arXiv page View PDF Add to collection Community This comment has been hidden (marked as Resolved) This…