Surrogate Objectives for Batch Policy Optimization in One-step Decision Making

Open in new window