Robust and Efficient Transfer Learning with Hidden Parameter Markov Decision Processes
Killian, Taylor W., Daulton, Samuel, Konidaris, George, Doshi-Velez, Finale
–Neural Information Processing Systems
We introduce a new formulation of the Hidden Parameter Markov Decision Process (HiP-MDP), a framework for modeling families of related tasks using low-dimensional latent embeddings. We also replace the original Gaussian Process-based model with a Bayesian Neural Network, enabling more scalable inference. Thus, we expand the scope of the HiP-MDP to applications with higher dimensions and more complex dynamics. Papers published at the Neural Information Processing Systems Conference.
Neural Information Processing Systems
Feb-14-2020, 18:43:08 GMT
- Technology: