Divergence-Augmented Policy Optimization