Getting ViT in Shape: Scaling Laws for Compute-Optimal Model Design Anonymous Author(s) Affiliation Address email Scaling laws have been recently employed to derive compute-optimal model size
–Neural Information Processing Systems
Neural Information Processing Systems
May-28-2025, 20:58:34 GMT
- Country:
- Asia > Middle East
- Israel (0.14)
- Oceania > Australia (0.14)
- Asia > Middle East
- Genre:
- Research Report > New Finding (0.67)
- Technology: