Pretrained Bayesian Non-parametric Knowledge Prior in Robotic Long-Horizon Reinforcement Learning
Meng, Yuan, Yao, Xiangtong, Chen, Kejia, Wu, Yansong, Zhang, Liding, Bing, Zhenshan, Knoll, Alois
–arXiv.org Artificial Intelligence
Pretrained Bayesian Non-parametric Knowledge Prior in Robotic Long-Horizon Reinforcement Learning Y uan Meng 1, Xiangtong Y ao 1, Kejia Chen 1, Y ansong Wu 1, Liding Zhang 1, Zhenshan Bing 2,, and Alois Knoll 1 IEEE fellow Abstract -- Reinforcement learning (RL) methods typically learn new tasks from scratch, often disregarding prior knowledge that could accelerate the learning process. While some methods incorporate previously learned skills, they usually rely on a fixed structure, such as a single Gaussian distribution, to define skill priors. This rigid assumption can restrict the diversity and flexibility of skills, particularly in complex, long-horizon tasks. In this work, we introduce a method that models potential primitive skill motions as having non-parametric properties with an unknown number of underlying features. We utilize a Bayesian non-parametric model, specifically Dirichlet Process Mixtures, enhanced with birth and merge heuristics, to pre-train a skill prior that effectively captures the diverse nature of skills. Additionally, the learned skills are explicitly trackable within the prior space, enhancing interpretability and control. Our findings show that a richer, non-parametric representation of skill priors significantly improves both the learning and execution of challenging robotic tasks. All data, code, and videos are available at https://ghiara.github.io/HELIOS/.
arXiv.org Artificial Intelligence
Mar-27-2025
- Country:
- Asia
- China > Jiangsu Province
- Nanjing (0.04)
- Middle East > Jordan (0.04)
- China > Jiangsu Province
- Europe > Germany
- Bavaria > Upper Bavaria > Munich (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.54)
- Industry:
- Education (0.87)
- Technology: