On the Structure of Floating-Point Noise in Batch-Invariant GPU Matrix Multiplication

Open in new window