Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models
–Neural Information Processing Systems
With Monarch matrices, Kronecker factorizations, and post-training quantization, we achieve non-vacuous generalization bounds for LLMs as large as LLaMA2-70B.
Neural Information Processing Systems
Oct-9-2025, 18:50:18 GMT
- Country:
- Asia > Japan
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Education (0.68)
- Health & Medicine > Therapeutic Area
- Neurology (1.00)
- Information Technology (0.67)
- Technology: