Soft Adaptive Policy Optimization