Surrogate Objectives for Batch Policy Optimization in One-step Decision Making