LOTFormer: Doubly-Stochastic Linear Attention via Low-Rank Optimal Transport

Open in new window