CRISP: Curriculum inducing Primitive Informed Subgoal Prediction
Singh, Utsav, Namboodiri, Vinay P
–arXiv.org Artificial Intelligence
Hierarchical reinforcement learning is a promising approach that uses temporal abstraction to solve complex long horizon problems. However, simultaneously learning a hierarchy of policies is unstable as it is challenging to train higher-level policy when the lower-level primitive is non-stationary. In this paper, we propose a novel hierarchical algorithm CRISP to generate a curriculum of achievable subgoals for evolving lower-level primitives using reinforcement learning and imitation learning. The lower level primitive periodically performs data relabeling on a handful of expert demonstrations using our primitive informed parsing approach to handle non-stationarity. Since our approach uses a handful of expert demonstrations, it is suitable for most robotic control tasks. Experimental evaluations on complex robotic maze navigation and robotic manipulation environments show that inducing hierarchical curriculum learning significantly improves sample efficiency, and results in efficient goal conditioned policies for solving temporally extended tasks. We perform real world robotic experiments on complex manipulation tasks and demonstrate that CRISP consistently outperforms the baselines.
arXiv.org Artificial Intelligence
Sep-23-2023
- Country:
- Asia
- India > Uttar Pradesh
- Kanpur (0.04)
- Middle East > Jordan (0.04)
- India > Uttar Pradesh
- Europe
- North America > United States
- California
- San Diego County > San Diego (0.04)
- San Francisco County > San Francisco (0.14)
- New York > New York County
- New York City (0.04)
- California
- Asia
- Genre:
- Research Report (1.00)
- Technology: