Will it Blend? Composing Value Functions in Reinforcement Learning

van Niekerk, Benjamin, James, Steven, Earle, Adam, Rosman, Benjamin

Jul-12-2018–arXiv.org Machine Learning

An important property for lifelong-learning agents is the ability to combine existing skills to solve unseen tasks. In general, however, it is unclear how to compose skills in a principled way. We provide a "recipe" for optimal value function composition in entropy-regularised reinforcement learning (RL) and then extend this to the standard RL setting. Composition is demonstrated in a video game environment, where an agent with an existing library of policies is able to solve new tasks without the need for further learning.

machine learning, q-function, reinforcement learning, (16 more...)

arXiv.org Machine Learning

Jul-12-2018

arXiv.org PDF

Add feedback

Country:
- Europe > Sweden
  - Stockholm > Stockholm (0.04)
- Africa > South Africa
  - Gauteng
    - Pretoria (0.04)
    - Johannesburg (0.04)

Genre:
- Research Report (0.64)
- Instructional Material (0.49)

Industry:
- Leisure & Entertainment > Games > Computer Games (0.54)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found