Surrogate Objectives for Batch Policy Optimization in One-step Decision Making
Minmin Chen, Ramki Gummadi, Chris Harris, Dale Schuurmans
–Neural Information Processing Systems
Neural Information Processing Systems
Oct-3-2025, 03:19:27 GMT
Minmin Chen, Ramki Gummadi, Chris Harris, Dale Schuurmans
–Neural Information Processing Systems
Neural Information Processing Systems
Oct-3-2025, 03:19:27 GMT