Randomized Matrix Sketching for Neural Network Training and Gradient Monitoring