Model Selection in Batch Policy Optimization