KERPLE: Kernelized Relative Positional Embedding for Length Extrapolation T a-Chung Chi

Neural Information Processing Systems 

RPEs effectively model the relative distance among tokens and enable length extrapolation.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found