LiFT: Unsupervised Reinforcement Learning with Foundation Models as Teachers

Nam, Taewook, Lee, Juyong, Zhang, Jesse, Hwang, Sung Ju, Lim, Joseph J., Pertsch, Karl

Dec-14-2023–arXiv.org Artificial Intelligence

We propose a framework that leverages foundation models as teachers, guiding a reinforcement learning agent to acquire semantically meaningful behavior without human feedback. In our framework, the agent receives task instructions grounded in a training environment from large language models. Then, a vision-language model guides the agent in learning the multi-task language-conditioned policy by providing reward feedback. We demonstrate that our method can learn semantically meaningful skills in a challenging open-ended MineDojo environment while prior unsupervised skill discovery methods struggle. Additionally, we discuss observed challenges of using off-the-shelf foundation models as teachers and our efforts to address them.

agent, arxiv preprint arxiv, instruction, (14 more...)

arXiv.org Artificial Intelligence

Dec-14-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California > Alameda County > Berkeley (0.04)
- Europe > Romania
  - Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)

Genre:
- Research Report (0.50)

Industry:
- Education (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning > Reinforcement Learning (1.00)