Active Reinforcement Learning: Observing Rewards at a Cost

Krueger, David, Leike, Jan, Evans, Owain, Salvatier, John

Nov-12-2020–arXiv.org Artificial Intelligence

Active reinforcement learning (ARL) is a variant on reinforcement learning where the agent does not observe the reward unless it chooses to pay a query cost c > 0. The central question of ARL is how to quantify the long-term value of reward information. Even in multi-armed bandits, computing the value of this information is intractable and we have to rely on heuristics. We propose and evaluate several heuristic approaches for ARL in multi-armed bandits and (tabular) Markov decision processes, and discuss and illustrate some challenging aspects of the ARL problem.

agent, algorithm, query, (12 more...)

arXiv.org Artificial Intelligence

Nov-12-2020

arXiv.org PDF

Add feedback

Country:
- North America
  - United States (0.04)
  - Canada > Quebec
    - Montreal (0.14)
- Europe > United Kingdom
  - England > Oxfordshire > Oxford (0.05)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.92)
  - Artificial Intelligence > Machine Learning
    - Reinforcement Learning (1.00)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found