Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons
Shi, Chengchun, Luo, Shikai, Le, Yuan, Zhu, Hongtu, Song, Rui
–arXiv.org Artificial Intelligence
Reinforcement learning (RL, see Sutton and Barto, 2018, for an overview) is concerned with how intelligence agents learn and take actions in an unknown environment in order to maximize the cumulative reward that it receives. It has been arguably one of the most vibrant research frontiers in machine learning over the last few years. According to Google Scholar, over 40K scientific articles have been published in 2020 with the phrase "reinforcement learning". Over 100 papers on RL were accepted for presentation at ICML 2021, a premier conference in the machine learning area, accounting for more than 10% of the accepted papers in total. RL algorithms have been applied in a wide variety of real applications, including games (Silver et al., 2016), robotics (Kormushev et al., 2013), healthcare (Komorowski et al., 2018), bidding (Jin et al., 2018), ridesharing (Xu et al., 2018) and automated driving (de Haan et al., 2019), to name a few. This paper is partly motivated by developing statistical learning methodologies in offline RL domains such as mobile health (mHealth).
arXiv.org Artificial Intelligence
Jul-26-2022
- Country:
- Asia > China
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- California > San Diego County
- San Diego (0.04)
- Massachusetts
- Middlesex County > Cambridge (0.04)
- Suffolk County > Boston (0.04)
- New York (0.04)
- North Carolina (0.04)
- California > San Diego County
- Genre:
- Research Report > New Finding (0.45)
- Industry:
- Health & Medicine
- Health Care Technology (0.67)
- Therapeutic Area > Endocrinology
- Diabetes (1.00)
- Transportation > Ground
- Road (0.67)
- Health & Medicine
- Technology: