Cumulative Learning Rate Adaptation: Revisiting Path-Based Schedules for SGD and Adam

Open in new window