Cutting Some Slack for SGD with Adaptive Polyak Stepsizes

Open in new window