Loki: Low-Rank Keys for Efficient Sparse Attention

Open in new window