Stacking Y our Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training
–Neural Information Processing Systems
Neural Information Processing Systems
Feb-8-2026, 06:37:26 GMT
- Country:
- Asia
- China > Hong Kong (0.04)
- Middle East > Jordan (0.04)
- Asia
- Genre:
- Research Report
- Experimental Study (0.93)
- New Finding (0.93)
- Research Report
- Industry:
- Information Technology (0.46)
- Technology: