Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
–Neural Information Processing Systems
Research on scaling large language models (LLMs) has primarily focused on model parameters and training data size, overlooking the role of vocabulary size.
Neural Information Processing Systems
Oct-11-2025, 00:42:32 GMT
- Country:
- Asia
- Europe
- North America
- Dominican Republic (0.04)
- United States > Ohio (0.04)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Government (0.67)
- Information Technology (0.67)
- Technology: