Geometry and convergence of natural policy gradient methods