Optimal Options for Multi-Task Reinforcement Learning Under Time Constraints

Del Verme, Manuel, da Silva, Bruno Castro, Baldassarre, Gianluca

Jan-6-2020–arXiv.org Artificial Intelligence

However, even to learn to solve simple tasks it can require millions of interactions. A promising approach to improve the learning speed relies on the options framework [6] An option is a'chunk of behaviour' that is formally defined as an initiation set, establishing in which states the option is available; a policy, indicating which actions to perform in each state; and a termination condition, establishing when the option execution is terminated. RL systems can benefit from the use of options to support faster exploration and learning especially when rewards are sparse or when the solution to a problem involves recurring behaviours. An important open problem is how can an agent autonomously learn options that are useful to solve tasks drawn from a given task distribution. Recent approaches have searched options for specific optimisation problems but they have not studied how optimal options are affected by different task features such as limited learning time budgets, task rewards, initial states, and the learning algorithm used.

agent, budget, time budget, (13 more...)

arXiv.org Artificial Intelligence

Jan-6-2020

arXiv.org PDF

Add feedback

Country:
- South America > Brazil
  - Rio Grande do Sul > Porto Alegre (0.04)
- North America > United States
  - Massachusetts
    - Suffolk County > Boston (0.04)
    - Middlesex County > Cambridge (0.04)
- Europe > Italy
  - Lazio > Rome (0.05)

Genre:
- Research Report (0.85)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.71)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found