Oscillation-Reduced MXFP4 Training for Vision Transformers

Open in new window