Theoretical Investigation of Adafactor for Non-Convex Smooth Optimization

Open in new window