SALS: Sparse Attention in Latent Space for KV cache Compression

Open in new window