Hyperbolic Feature Augmentation via Distribution Estimation and Infinite Sampling on Manifolds Zhi Gao 1, Y uwei Wu

Neural Information Processing Systems 

We use the SGD optimizer in the three training stage.