MSWA: Refining Local Attention with Multi-ScaleWindow Attention

Open in new window