Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation
–Neural Information Processing Systems
With increasing success in reinforcement learning (RL), there is broad interest in applying these methods to real-world settings. This has brought exciting progress in offline RL and off-policy policy evaluation (OPPE).
Neural Information Processing Systems
Aug-16-2025, 19:21:37 GMT
- Country:
- Europe > United Kingdom
- England > Greater London > London (0.04)
- North America
- Canada (0.04)
- United States > Massachusetts
- Middlesex County > Cambridge (0.14)
- Europe > United Kingdom
- Genre:
- Research Report (0.67)
- Industry:
- Technology: