e57c6b956a6521b28495f2886ca0977a-Paper.pdf

Neural Information Processing Systems 

Attention mechanism has shown great performance and efficiency in a lot of deep learning models, in which relative position encoding plays a crucial role. However, when introducing attention to manifolds, there is no canonical local coordinate system to parameterize neighborhoods.