SoftTreeMax: Policy Gradient with Tree Search

Open in new window