On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk