MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured Attention

Open in new window