Streaming Attention Approximation via Discrepancy Theory

Open in new window