Bidirectional Decoding: Improving Action Chunking via Closed-Loop Resampling

Liu, Yuejiang, Hamid, Jubayer Ibn, Xie, Annie, Lee, Yoonho, Du, Maximilian, Finn, Chelsea

Aug-30-2024–arXiv.org Artificial Intelligence

The increasing availability of human demonstrations has spurred renewed interest in behavioral cloning [1, 2]. In particular, recent studies have highlighted the potential of learning from large-scale demonstrations to acquire a variety of complex skills [3, 4, 5, 6, 7, 8]. However, this approach still struggles with two common properties of human demonstrations: (i) strong temporal dependencies across multiple steps, such as idle pauses [4] and latent strategies [9, 10], (ii) large style variability across different demonstrations, including differences in proficiency [11] and preference [12]. Oftentimes, both properties are prevalent yet unlabeled in collected data, posing significant challenges to traditional behavioral cloning, which typically learns a discriminative model to map an input state to a target action. In response to these challenges, recent works have pursued a generative approach characterized by two key elements: (i) predicting a sequence of actions over multiple time steps and executing all or part of the sequence, known as action chunking [3] or receding horizon [4]; (ii) modeling the distribution of action chunks and sampling from the learned model in an independent [4, 13] or weakly dependent [3, 14] manner during deployment. Some studies find these elements crucial for learning a performant policy in controlled laboratory scenarios [3, 4], while other recent work reports opposite outcomes under practical conditions [6]. The reasons behind these conflicting results remain unclear.

action chunk, demonstration, international conference, (14 more...)

arXiv.org Artificial Intelligence

Aug-30-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > Massachusetts
    - Suffolk County > Boston (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Asia
  - Middle East > UAE
    - Abu Dhabi Emirate > Abu Dhabi (0.04)
  - Japan > Honshū
    - Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre:
- Research Report (1.00)

Industry:
- Energy > Renewable > Geothermal > Geothermal Energy Systems and Facilities > Geothermal System for Power Generation > Advanced Geothermal System (AGS) (0.42)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Natural Language (1.00)
  - Machine Learning (1.00)
  - Representation & Reasoning > Agents (0.45)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found