Tractable structured natural gradient descent using local parameterizations
Lin, Wu, Nielsen, Frank, Khan, Mohammad Emtiyaz, Schmidt, Mark
Natural-gradient descent on structured parameter spaces (e.g., low-rank covariances) is computationally challenging due to complicated inverse Fisher-matrix computations. We address this issue for optimization, inference, and search problems by using \emph{local-parameter coordinates}. Our method generalizes an existing evolutionary-strategy method, recovers Newton and Riemannian-gradient methods as special cases, and also yields new tractable natural-gradient algorithms for learning flexible covariance structures of Gaussian and Wishart-based distributions. We show results on a range of applications on deep learning, variational inference, and evolution strategies. Our work opens a new direction for scalable structured geometric methods via local parameterizations.
Feb-15-2021
- Country:
- Asia > Japan (0.04)
- North America > Canada
- British Columbia (0.04)
- Genre:
- Research Report (0.63)
- Technology: