Portable Option Discovery for Automated Learning Transfer in Object-Oriented Markov Decision Processes
Topin, Nicholay (University of Maryland, Baltimore County) | Haltmeyer, Nicholas (University of Maryland, Baltimore County) | Squire, Shawn (University of Maryland, Baltimore County) | Winder, John (University of Maryland, Baltimore County) | desJardins, Marie (University of Maryland, Baltimore County) | MacGlashan, James (Brown University)
We introduce a novel framework for option discovery and learning transfer in complex domains that are represented as object-oriented Markov decision processes (OO-MDPs) [Diuk et al., 2008]. Our framework, Portable Option Discovery (POD), extends existing option discovery methods, and enables transfer across related but different domains by providing an unsupervised method for finding a mapping between object-oriented domains with different state spaces. The framework also includes heuristic approaches for increasing the efficiency of the mapping process. We present the results of applying POD to Pickett and Barto's [2002] PolicyBlocks and MacGlashan's [2013] Option-Based Policy Transfer in two application domains. We show that our approach can discover options effectively, transfer options among different domains, and improve learning performance with low computational overhead.
Jul-15-2015
- Country:
- North America > United States
- Massachusetts (0.04)
- Maryland
- Baltimore County (0.04)
- Baltimore (0.04)
- Europe > United Kingdom
- England > Greater London > London (0.04)
- North America > United States
- Genre:
- Research Report (0.93)
- Industry:
- Transportation > Passenger (0.49)