FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs

Open in new window