Optimistic Reinforcement Learning-Based Skill Insertions for Task and Motion Planning
Liu, Gaoyuan, de Winter, Joris, Durodie, Yuri, Steckelmacher, Denis, Nowe, Ann, Vanderborght, Bram
–arXiv.org Artificial Intelligence
Abstract--T ask and motion planning (T AMP) for robotics manipulation necessitates long-horizon reasoning involving versatile actions and skills. While deterministic actions can be crafted by sampling or optimizing with certain constraints, planning actions with uncertainty, i.e., probabilistic actions, remains a challenge for T AMP . On the contrary, Reinforcement Learning (RL) excels in acquiring versatile, yet short-horizon, manipulation skills that are robust with uncertainties. Besides the policy, a RL skill is defined with data-driven logical components that enable the skill to be deployed by symbolic planning. A plan refinement sub-routine is designed to further tackle the inevitable effect uncertainties. In the experiments, we compare our method with baseline hierarchical planning from both T AMP and RL fields and illustrate the strength of the method. The results show that by embedding RL skills, we extend the capability of T AMP to domains with probabilistic skills, and improve the planning efficiency compared to the previous methods. Reinforcement Learning (RL) empowers robots to acquire manipulation skills without human programming. However, prior works mostly tackle single-skill or short-term manipulation tasks, such as grasping [1] or peg insertion [2] or synergies between two actions [3]. The long-horizon manipulation planning remains a challenge in the RL field because of expanding state/action spaces and sparse rewards etc [4].
arXiv.org Artificial Intelligence
Oct-17-2025
- Country:
- Asia > China (0.04)
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Belgium > Brussels-Capital Region
- North America > United States (0.04)
- Genre:
- Research Report (0.84)
- Industry:
- Education (0.46)
- Technology: