Luna: LinearUnifiedNestedAttention
–Neural Information Processing Systems
The quadratic computational and memory complexities of the Transformer'sattention mechanism have limited its scalability for modeling long sequences.
Neural Information Processing Systems
Feb-7-2026, 14:13:44 GMT
- Country:
- Technology: