A Review of Off-Policy Evaluation in Reinforcement Learning

Uehara, Masatoshi, Shi, Chengchun, Kallus, Nathan

Dec-12-2022–arXiv.org Artificial Intelligence

Reinforcement learning (RL) is one of the most vibrant research frontiers in machine learning and has been recently applied to solve a number of challenging problems. In this paper, we primarily focus on off-policy evaluation (OPE), one of the most fundamental topics in RL. In recent years, a number of OPE methods have been developed in the statistics and computer science literature. We provide a discussion on the efficiency bound of OPE, some of the existing state-of-the-art OPE methods, their statistical properties and some other related research directions that are currently actively explored.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

Dec-12-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Washington > King County
    - Seattle (0.04)
  - New York > New York County
    - New York City (0.14)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
    - Belmont (0.04)
  - Florida > Palm Beach County
    - Boca Raton (0.04)
- Europe > United Kingdom
  - England
    - Cambridgeshire > Cambridge (0.14)
    - Oxfordshire > Oxford (0.04)
    - Greater London > London (0.04)

Genre:
- Research Report
  - Experimental Study (1.00)
  - Strength High (0.92)

Industry:
- Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.67)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning (1.00)
  - Reinforcement Learning (1.00)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found