Automatic Discovery and Transfer of Task Hierarchies in Reinforcement Learning
A principal one among them is the existence of multiple domains that share the same underlying causal structure for actions. We describe an approach that exploits this shared causal structure to discover a hierarchical task structure in a source domain, which in turn speeds up learning of task execution knowledge in a new target domain. Our approach is theoretically justified and compares favorably to manually designed task hierarchies in learning efficiency in the target domain. We demonstrate that causally motivated task hierarchies transfer more robustly than other kinds of detailed knowledge that depend on the idiosyncrasies of the source domain and are hence less transferable. These domains are complex, and good performance requires selecting long chains of actions to achieve subgoals needed for ultimate success.
Jan-4-2018, 08:46:46 GMT
- Technology: