Calibrating the Learning Rate for Adaptive Gradient Methods to Improve Generalization Performance

Open in new window