Trust Region-Guided Proximal Policy Optimization