A Review of Off-Policy Evaluation in Reinforcement Learning
Uehara, Masatoshi, Shi, Chengchun, Kallus, Nathan
–arXiv.org Artificial Intelligence
Reinforcement learning (RL) is one of the most vibrant research frontiers in machine learning and has been recently applied to solve a number of challenging problems. In this paper, we primarily focus on off-policy evaluation (OPE), one of the most fundamental topics in RL. In recent years, a number of OPE methods have been developed in the statistics and computer science literature. We provide a discussion on the efficiency bound of OPE, some of the existing state-of-the-art OPE methods, their statistical properties and some other related research directions that are currently actively explored.
arXiv.org Artificial Intelligence
Dec-12-2022
- Country:
- North America > United States
- Washington > King County
- Seattle (0.04)
- New York > New York County
- New York City (0.14)
- Massachusetts > Middlesex County
- Florida > Palm Beach County
- Boca Raton (0.04)
- Washington > King County
- Europe > United Kingdom
- England
- Cambridgeshire > Cambridge (0.14)
- Oxfordshire > Oxford (0.04)
- Greater London > London (0.04)
- England
- North America > United States
- Genre:
- Research Report
- Experimental Study (1.00)
- Strength High (0.92)
- Research Report
- Industry:
- Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.67)
- Technology: