Paper page - Forcing-KV: Hybrid KV Cache Compression for Efficient Autoregressive Video Diffusion Models
…complexity and memory overhead from redundant key-value caches, which are addressed through a hybrid compression strategy that separates attention heads into static and dynamic categories for optimized caching. AI-generated summary…