Optimistic Proximal Policy Optimization