Unsupervised Skill Discovery via Recurrent Skill Training

Jan-19-2025, 09:00:52 GMT–Neural Information Processing Systems

Being able to discover diverse useful skills without external reward functions is beneficial in reinforcement learning research. Previous unsupervised skill discovery approaches mainly train different skills in parallel. Although impressive results have been provided, we found that parallel training procedure can sometimes block exploration when the state visited by different skills overlap, which leads to poor state coverage and restricts the diversity of learned skills. In this paper, we take a deeper look into this phenomenon and propose a novel framework to address this issue, which we call Recurrent Skill Training (ReST). Instead of training all the skills in parallel, ReST trains different skills one after another recurrently, along with a state coverage based intrinsic reward.

recurrent skill training, state coverage, unsupervised skill discovery, (1 more...)

Neural Information Processing Systems

Jan-19-2025, 09:00:52 GMT

Conferences Web Page

Add feedback

Genre:
- Instructional Material (0.65)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.44)