Understanding the Effect of Stochasticity in Policy Optimization Jincheng Mei