INT-FlashAttention: Enabling Flash Attention for INT8 Quantization

Open in new window