Exploring the Limits of KV Cache Compression in Visual Autoregressive Transformers

Open in new window