DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following

Gao, Xiaofeng, Gao, Qiaozi, Gong, Ran, Lin, Kaixiang, Thattai, Govind, Sukhatme, Gaurav S.

Aug-15-2022–arXiv.org Artificial Intelligence

Language-guided Embodied AI benchmarks requiring an agent to navigate an environment and manipulate objects typically allow one-way communication: the human user gives a natural language command to the agent, and the agent can only follow the command passively. We present DialFRED, a dialogue-enabled embodied instruction following benchmark based on the ALFRED benchmark. DialFRED allows an agent to actively ask questions to the human user; the additional information in the user's response is used by the agent to better complete its task. We release a human-annotated dataset with 53K task-relevant questions and answers and an oracle to answer questions. To solve DialFRED, we propose a questioner-performer framework wherein the questioner is pre-trained with the human-annotated data and fine-tuned with reinforcement learning. We make DialFRED publicly available and encourage researchers to propose and evaluate their solutions to building dialog-enabled embodied agents.

agent, instruction, questioner, (16 more...)

arXiv.org Artificial Intelligence

Aug-15-2022

arXiv.org PDF

Add feedback

Country:
- North America
  - United States (0.04)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Europe > Germany
  - Berlin (0.04)

Genre:
- Workflow (0.69)
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (0.67)
  - Representation & Reasoning > Agents (0.67)
  - Natural Language > Discourse & Dialogue (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found