DistrAttention: An Efficient and Flexible Self-Attention Mechanism on Modern GPUs