SCOUT: Toward Sub-Quadratic Attention via Segment Compression for Optimized Utility in Transformers

Open in new window