Fast Multipole Attention: A Divide-and-Conquer Attention Mechanism for Long Sequences

Open in new window