Gradient Informed Proximal Policy Optimization