Loki: Low-rank Keys for Efficient Sparse Attention

Open in new window