SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

Open in new window