Automatic Discovery and Transfer of Task Hierarchies in Reinforcement Learning

Mehta, Neville (Oregon State University) | Ray, Soumya (Case Western Reserve University) | Tadepalli, Prasad (Oregon State University) | Dietterich, Thomas (Oregon State University)

AI Magazine 

A principal one among them is the existence of multiple domains that share the same underlying causal structure for actions. We describe an approach that exploits this shared causal structure to discover a hierarchical task structure in a source domain, which in turn speeds up learning of task execution knowledge in a new target domain. Our approach is theoretically justified and compares favorably to manually designed task hierarchies in learning efficiency in the target domain. We demonstrate that causally motivated task hierarchies transfer more robustly than other kinds of detailed knowledge that depend on the idiosyncrasies of the source domain and are hence less transferable.