E2CL: Exploration-based Error Correction Learning for Embodied Agents

Wang, Hanlin, Leong, Chak Tou, Wang, Jian, Li, Wenjie

Sep-5-2024–arXiv.org Artificial Intelligence

Language models are exhibiting increasing capability in knowledge utilization and reasoning. However, when applied as agents in embodied environments, they often suffer from misalignment between their intrinsic knowledge and environmental knowledge, leading to infeasible actions. Traditional environment alignment methods, such as supervised learning on expert trajectories and reinforcement learning, face limitations in covering environmental knowledge and achieving efficient convergence, respectively. Inspired by human learning, we propose Exploration-based Error Correction Learning (E2CL), a novel framework that leverages exploration-induced errors and environmental feedback to enhance environment alignment for LM-based agents. E2CL incorporates teacher-guided and teacher-free exploration to gather environmental feedback and correct erroneous actions. The agent learns to provide feedback and self-correct, thereby enhancing its adaptability to target environments. Evaluations in the Virtualhome environment demonstrate that E2CL-trained agents outperform those trained by baseline methods and exhibit superior self-correction capabilities.

agent, language model, speculative inference, (11 more...)

arXiv.org Artificial Intelligence

Sep-5-2024

arXiv.org PDF

Add feedback

Country:
- Asia > China
  - Hong Kong (0.04)
  - Guangxi Province > Nanning (0.04)

Genre:
- Research Report (0.82)

Technology:
- Information Technology
  - Data Science > Data Quality
    - Data Cleaning (0.61)
  - Artificial Intelligence
    - Natural Language > Large Language Model (0.70)
    - Representation & Reasoning > Agents (0.52)
    - Machine Learning > Learning Graphical Models
      - Undirected Networks > Markov Models (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found