Faster Neighborhood Attention: Reducing the O(n) Cost of Self Attention at the Threadblock Level Ali Hassani 1, Wen-mei Hwu

Open in new window