Robust and Efficient Transfer Learning with Hidden-Parameter Markov Decision Processes
Killian, Taylor, Daulton, Samuel, Konidaris, George, Doshi-Velez, Finale
We introduce a new formulation of the Hidden Parameter Markov Decision Process (HiP-MDP), a framework for modeling families of related tasks using low-dimensional latent embeddings. Our new framework correctly models the joint uncertainty in the latent parameters and the state space. We also replace the original Gaussian Process-based model with a Bayesian Neural Network, enabling more scalable inference. Thus, we expand the scope of the HiP-MDP to applications with higher dimensions and more complex dynamics.
Oct-30-2017
- Country:
- North America > United States > California > Los Angeles County > Long Beach (0.04)
- Genre:
- Research Report (0.82)
- Industry:
- Technology: