A Benchmark for Evaluating Language Model Fit
–Neural Information Processing Systems
Evaluations of language models (LMs) commonly report perplexity on monolithic data held out from training. Implicitly or explicitly, this data is composed of domains--varying distributions of language.
Neural Information Processing Systems
May-30-2025, 02:41:32 GMT
- Country:
- Asia > Middle East
- UAE (0.14)
- Europe (1.00)
- North America > United States
- Texas (0.14)
- Asia > Middle East
- Genre:
- Research Report (1.00)
- Industry:
- Health & Medicine (1.00)
- Leisure & Entertainment > Games (0.45)
- Media > News (0.46)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks (0.67)
- Natural Language
- Chatbot (0.46)
- Large Language Model (0.68)
- Communications > Social Media (1.00)
- Software (1.00)
- Artificial Intelligence
- Information Technology