Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing
–Neural Information Processing Systems
Due to their size, the capability of these networks has increased tremendously, but this has come at the cost of a significant increase in necessary compute.
Neural Information Processing Systems
Feb-17-2026, 20:01:18 GMT
- Country:
- Asia
- China > Hong Kong (0.04)
- Middle East
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Italy > Tuscany
- Florence (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Belgium > Brussels-Capital Region
- North America
- Dominican Republic (0.04)
- United States > Minnesota
- Hennepin County > Minneapolis (0.14)
- Asia
- Genre:
- Research Report (0.46)
- Technology: