Paper page - Beyond the Last Layer: Multi-Layer Representation Fusion for Visual Tokenization
…Xuanyu Zhu , , Yang Shi , , , , Abstract DRoRAE enhances visual representation by fusing multi-layer features from pretrained vision encoders through adaptive routing and incremental correction, improving reconstruction and generation quality. AI-generated summary…