Skill Transformer: A Monolithic Policy for Mobile Manipulation

Huang, Xiaoyu, Batra, Dhruv, Rai, Akshara, Szot, Andrew

Aug-18-2023–arXiv.org Artificial Intelligence

Prior works [6, 33, 37] show that long-horizon robotic tasks by combining conditional sequence it is possible to end-to-end learn a single policy that can modeling and skill modularity. Conditioned on egocentric perform such diverse tasks and behaviors. Such a monolithic and proprioceptive observations of a robot, Skill policy directly maps observations to low-level actions, Transformer is trained end-to-end to predict both a highlevel reasoning both about which skill to execute and how to skill (e.g., navigation, picking, placing), and a wholebody execute it. However, scaling end-to-end learning to longhorizon, low-level action (e.g., base and arm motion), using multi-phase tasks with thousands of low-level steps a transformer architecture and demonstration trajectories and egocentric visual observations remains a challenging that solve the full task. It retains the composability and research question.

artificial intelligence, machine learning, skill transformer, (14 more...)

arXiv.org Artificial Intelligence

Aug-18-2023

arXiv.org PDF

Add feedback

Country:
- Europe > Switzerland > Zürich > Zürich (0.14)

Genre:
- Research Report > New Finding (0.48)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.67)
  - Robots (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found