Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions

Skrynnik, Alexey, Volovikova, Zoya, Côté, Marc-Alexandre, Voronov, Anton, Zholus, Artem, Arabzadeh, Negar, Mohanty, Shrestha, Teruel, Milagro, Awadallah, Ahmed, Panov, Aleksandr, Burtsev, Mikhail, Kiseleva, Julia

Nov-1-2022–arXiv.org Artificial Intelligence

The adoption of pre-trained language models to generate action plans for embodied agents is a promising research strategy. However, execution of instructions in real or simulated environments requires verification of the feasibility of actions as well as their relevance to the completion of a goal. We propose a new method that combines a language model and reinforcement learning for the task of building objects in a Minecraft-like environment according to the natural language instructions. Our method first generates a set of consistently achievable sub-goals from the instructions and then completes associated sub-tasks with a pre-trained RL policy. The proposed method formed the RL baseline at the IGLU 2022 competition.

large language model, machine learning, natural language, (13 more...)

arXiv.org Artificial Intelligence

Nov-1-2022

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (0.69)
  - Natural Language > Large Language Model (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found