Efficient Inference and Exploration for Reinforcement Learning

Oct-11-2019–arXiv.org Machine Learning

Despite an ever growing literature on reinforcement learning algorithms and applications, much less is known about their statistical inference. In this paper, we investigate the large sample behaviors of the Q-value estimates with closed-form characterizations of the asymptotic variances. This allows us to efficiently construct confidence regions for Q-value and optimal value functions, and to develop policies to minimize their estimation errors. This also leads to a policy exploration strategy that relies on estimating the relative discrepancies among the Q estimates. Numerical experiments show superior performances of our exploration strategy than other benchmark approaches.

null, optimization problem, upstream oil & gas, (19 more...)

arXiv.org Machine Learning

Oct-11-2019

arXiv.org PDF

Add feedback

Genre:
- Research Report (1.00)

Industry:
- Energy > Oil & Gas > Upstream (0.54)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Representation & Reasoning > Optimization (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found