Pretrained Bayesian Non-parametric Knowledge Prior in Robotic Long-Horizon Reinforcement Learning

Meng, Yuan, Yao, Xiangtong, Chen, Kejia, Wu, Yansong, Zhang, Liding, Bing, Zhenshan, Knoll, Alois

Mar-27-2025–arXiv.org Artificial Intelligence

Pretrained Bayesian Non-parametric Knowledge Prior in Robotic Long-Horizon Reinforcement Learning Y uan Meng 1, Xiangtong Y ao 1, Kejia Chen 1, Y ansong Wu 1, Liding Zhang 1, Zhenshan Bing 2,, and Alois Knoll 1 IEEE fellow Abstract -- Reinforcement learning (RL) methods typically learn new tasks from scratch, often disregarding prior knowledge that could accelerate the learning process. While some methods incorporate previously learned skills, they usually rely on a fixed structure, such as a single Gaussian distribution, to define skill priors. This rigid assumption can restrict the diversity and flexibility of skills, particularly in complex, long-horizon tasks. In this work, we introduce a method that models potential primitive skill motions as having non-parametric properties with an unknown number of underlying features. We utilize a Bayesian non-parametric model, specifically Dirichlet Process Mixtures, enhanced with birth and merge heuristics, to pre-train a skill prior that effectively captures the diverse nature of skills. Additionally, the learned skills are explicitly trackable within the prior space, enhancing interpretability and control. Our findings show that a richer, non-parametric representation of skill priors significantly improves both the learning and execution of challenging robotic tasks. All data, code, and videos are available at https://ghiara.github.io/HELIOS/.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

Mar-27-2025

arXiv.org PDF

Add feedback

Country:
- Europe > Germany
  - Bavaria > Upper Bavaria > Munich (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - China > Jiangsu Province
    - Nanjing (0.04)

Genre:
- Research Report > New Finding (0.54)

Industry:
- Education (0.87)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning (1.00)
  - Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found