Reviews: On the Ineffectiveness of Variance Reduced Optimization for Deep Learning
–Neural Information Processing Systems
I'm glad you commented on the learning rate selection, because this was a major point of our discussion. The main reason I can't increase my score is that many definitions, explanations and experiment details are missing, making it extremely hard to evaluate the real value of your experiments. This was additionally complicated by the fact that you didn't provide your code when submitting the paper. I hope that you will do a major revision, for example include a section in supplementary material with all experiments details. Just in case, here are my suggestions for some extra experiments: 1. Measure and plot the effect of data augmentation on bias of the gradient.
Neural Information Processing Systems
Jan-25-2025, 07:09:50 GMT
- Technology: