Boosting Trust Region Policy Optimization by Normalizing Flows Policy