Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
–Neural Information Processing Systems
Research on scaling large language models (LLMs) has primarily focused on model parameters and training data size, overlooking the role of vocabulary size.
Neural Information Processing Systems
Feb-18-2026, 05:23:05 GMT
- Country:
- North America
- United States > Ohio (0.04)
- Dominican Republic (0.04)
- Europe
- Asia
- North America
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Industry:
- Information Technology (0.67)
- Government (0.67)
- Technology: