Review for NeurIPS paper: Minimax Value Interval for Off-Policy Evaluation and Policy Optimization

Open in new window