Reinforcement Learning, Bit by Bit

Lu, Xiuyuan, Van Roy, Benjamin, Dwaracherla, Vikranth, Ibrahimi, Morteza, Osband, Ian, Wen, Zheng

Mar-14-2021–arXiv.org Artificial Intelligence

Reinforcement learning agents have demonstrated remarkable achievements in simulated environments. Data efficiency poses an impediment to carrying this success over to real environments. The design of data-efficient agents calls for a deeper understanding of information acquisition and representation. We develop concepts and establish a regret bound that together offer principled guidance. The bound sheds light on questions of what information to seek, how to seek that information, and what information to retain. To illustrate concepts, we design simple agents that build on them and present computational results that demonstrate improvements in data efficiency. Other learning paradigms are about minimization; reinforcement learning is about maximization.

agent, information, value function, (16 more...)

arXiv.org Artificial Intelligence

Mar-14-2021

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - New South Wales > Sydney (0.04)
- North America > United States
  - Massachusetts
    - Middlesex County > Belmont (0.04)
    - Hampshire County > Amherst (0.04)
- Europe
  - Kosovo > District of Gjilan
    - Kamenica (0.04)
  - France > Hauts-de-France
    - Nord > Lille (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (0.81)

Industry:
- Education (0.67)
- Leisure & Entertainment > Games
  - Computer Games (0.92)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning
    - Agents (1.00)
    - Uncertainty > Bayesian Inference (0.92)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.92)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found