An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding

Mar-20-2026, 23:27:54 GMT–Neural Information Processing Systems

Recently, many methods have been developed to extend the context length of pre-trained large language models (LLMs), but they often require fine-tuning at the target length ($\gg4K$) and struggle to effectively utilize information from the middle part of the context.

artificial intelligence, large language model, natural language, (11 more...)

Neural Information Processing Systems

Mar-20-2026, 23:27:54 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.88)