Understanding the impact of entropy on policy optimization