HyperAttention: Long-context Attention in Near-Linear Time

Open in new window