DAPE: Data-AdaptivePositionalEncodingforLength Extrapolation

Neural Information Processing Systems 

Positional encoding plays a crucial role in transformers, significantly impactingmodel performance andlength generalization. Prior research hasintroduced absolute positional encoding (APE) and relative positional encoding (RPE) to distinguish token positions in given sequences.