An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding

Neural Information Processing Systems 

Recently, many methods have been developed to extend the context length of pre-trained large language models (LLMs), but they often require fine-tuning at the target length ($\gg4K$) and struggle to effectively utilize information from the middle part of the context.