ProxyAttn: Guided Sparse Attention via Representative Heads

Open in new window