Training Agents using Upside-Down Reinforcement Learning

Srivastava, Rupesh Kumar, Shyam, Pranav, Mutz, Filipe, Jaśkowski, Wojciech, Schmidhuber, Jürgen

Dec-5-2019–arXiv.org Artificial Intelligence

Traditional Reinforcement Learning (RL) algorithms either predict rewards with value functions or maximize them using policy search. We study an alternative: Upside-Down Reinforcement Learning (Upside-Down RL or UDRL), that solves RL problems primarily using supervised learning techniques. Many of its main principles are outlined in a companion report [34]. Here we present the first concrete implementation of UDRL and demonstrate its feasibility on certain episodic learning problems. Experimental results show that its performance can be surprisingly competitive with, and even exceed that of traditional baseline algorithms developed over decades of research.

agent, algorithm, behavior function, (15 more...)

arXiv.org Artificial Intelligence

Dec-5-2019

arXiv.org PDF

Add feedback

Country:
- South America > Brazil (0.04)
- North America > United States
  - Texas > Travis County > Austin (0.14)

Genre:
- Research Report > New Finding (0.34)

Industry:
- Education (0.34)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks > Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found