Global Policy Construction in Modular Reinforcement Learning

Zhang, Ruohan (The University of Texas at Austin) | Song, Zhao (The University of Texas at Austin) | Ballard, Dana H. (The University of Texas at Austin)

AAAI Conferences 

We propose a modular reinforcement learning algorithm which decomposes a Markov decision process into independent modules. Each module is trained using Sarsa(lambda). We introduce three algorithms for forming global policy from modules policies, and demonstrate our results using a 2D grid world.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found