Hardware-aligned Hierarchical Sparse Attention for Efficient Long-term Memory Access

Open in new window