Review for NeurIPS paper: SMYRF - Efficient Attention using Asymmetric Clustering

Jan-24-2025, 00:52:40 GMT–Neural Information Processing Systems

This paper proposes a method for reducing the quadratic bottleneck of transformer architectures to O(N log N), using an asymmetric LHS clustering strategy. The paper also shows that finding an optimal assignment is NP-hard and thus, heuristic approaches must be pursued. They propose a novel type of balanced clustering algorithm to approximate attention. The method can be directly used for pre-trained models and achieves competitive/better performance with BigGAN/BERT/RoBERTa by shrinking 50% memory. There was some disagreement among reviewers about this paper, with R1 and R3 recommending solid acceptance, and R2 and R4 recommending weak reject.

asymmetric clustering, efficient attention, neurips paper, (9 more...)

Neural Information Processing Systems

Jan-24-2025, 00:52:40 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.95)