SALS: Sparse Attention in Latent Space for KV Cache Compression

Open in new window