Convergence Properties of Natural Gradient Descent for Minimizing KL Divergence