Memory-EfficientExactAttentionwithIO-Awareness
–Neural Information Processing Systems
We argue that a missing principle is making attention algorithmsIO-aware-- accounting for reads and writes between levels of GPU memory.
Neural Information Processing Systems
Feb-9-2026, 13:24:54 GMT
- Country:
- Europe > Italy
- Calabria > Catanzaro Province
- Catanzaro (0.04)
- Tuscany > Florence (0.04)
- Calabria > Catanzaro Province
- North America
- Mexico (0.04)
- United States > California
- Santa Clara County > Palo Alto (0.04)
- Europe > Italy
- Industry:
- Government (0.46)
- Information Technology (0.46)
- Technology: