On Proximal Policy Optimization's Heavy-tailed Gradients