Unsupervised skill discovery with contrastive intrinsic control

Apr-1-2022, 14:00:00 GMT–AIHub

Unsupervised Reinforcement Learning (RL), where RL agents pre-train with self-supervised rewards, is an emerging paradigm for developing RL agents that are capable of generalization. Recently, we released the Unsupervised RL Benchmark (URLB) which we covered in a previous post. A surprising finding was that competence-based algorithms significantly underperformed other categories. In this post we will demystify what has been holding back competence-based methods and introduce Contrastive Intrinsic Control (CIC), a new competence-based algorithm that is the first to achieve leading results on URLB. To recap, competence-based methods (which we will cover in detail) maximize the mutual information between states and skills (e.g.

algorithm, contrastive intrinsic control, mutual information, (11 more...)

AIHub

Apr-1-2022, 14:00:00 GMT

News Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found