Understanding the Effect of Stochasticity in Policy Optimization

Neural Information Processing Systems 

We study the effect of stochasticity in on-policy policy optimization, and make the following four contributions.