Unveil Benign Overfitting for Transformer in Vision: Training Dynamics, Convergence, and Generalization