Online Multi-Task Gradient Temporal-Difference Learning

Sreenivasan, Vishnu Purushothaman (University of Pennsylvania) | Ammar, Haitham Bou (University of Pennsylvania) | Eaton, Eric (University of Pennsylvania)

Jul-14-2014–AAAI Conferences

We develop an online multi-task formulation of model-based gradient temporal-difference (GTD) reinforcement learning. Our approach enables an autonomous RL agent to accumulate knowledge over its lifetime and efficiently share this knowledge between tasks to accelerate learning. Rather than learning a policy for a reinforcement learning task tabula rasa, as in standard GTD, our approach rapidly learns a high performance policy by building upon the agent's previously learned knowledge. Our preliminary results on controlling different mountain car tasks demonstrates that GTD-ELLA significantly improves learning over standard GTD(0).

artificial intelligence, machine learning, reinforcement learning, (13 more...)

AAAI Conferences

Jul-14-2014

Conferences PDF

Add feedback

Country:
- North America > United States > Pennsylvania (0.05)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found