DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method

Neural Information Processing Systems 

This paper proposes a new easy-to-implement parameter-free gradient-based optimizer: DoWG (Distance over Weighted Gradients).