A Tale of Two Geometries: Adaptive Optimizers and Non-Euclidean Descent

Open in new window