Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations Alexander Hägele
–Neural Information Processing Systems
Scale has become a main ingredient in obtaining strong machine learning models. As a result, understanding a model's scaling properties is key to effectively
Neural Information Processing Systems
Nov-19-2025, 20:38:03 GMT
- Country:
- Europe > Italy
- North America > United States
- Minnesota > Hennepin County > Minneapolis (0.14)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Technology: