Beyond Length: Quantifying Long-Range Information for Long-Context LLM Pretraining Data

Open in new window