UniGraspTransformer: Simplified Policy Distillation for Scalable Dexterous Robotic Grasping

Wang, Wenbo, Wei, Fangyun, Zhou, Lei, Chen, Xi, Luo, Lin, Yi, Xiaohan, Zhang, Yizhong, Liang, Yaobo, Xu, Chang, Lu, Yan, Yang, Jiaolong, Guo, Baining

Dec-3-2024–arXiv.org Artificial Intelligence

We introduce UniGraspTransformer, a universal Transformer-based network for dexterous robotic grasping that simplifies training while enhancing scalability and performance. Unlike prior methods such as UniDexGrasp++, which require complex, multi-step training pipelines, UniGraspTransformer follows a streamlined process: first, dedicated policy networks are trained for individual objects using reinforcement learning to generate successful grasp trajectories; then, these trajectories are distilled into a single, universal network. Our approach enables UniGraspTransformer to scale effectively, incorporating up to 12 self-attention blocks for handling thousands of objects with diverse poses. Additionally, it generalizes well to both idealized and real-world inputs, evaluated in state-based and vision-based settings. Notably, UniGraspTransformer generates a broader range of grasping poses for objects in various shapes and orientations, resulting in more diverse grasp strategies. Experimental results demonstrate significant improvements over state-of-the-art, UniDexGrasp++, across various object categories, achieving success rate gains of 3.5%, 7.7%, and 10.1% on seen objects, unseen objects within seen categories, and completely unseen objects, respectively, in the vision-based setting. Project page: https://dexhand.github.io/UniGraspTransformer.

machine learning, object-oriented architecture, reinforcement learning, (20 more...)

arXiv.org Artificial Intelligence

Dec-3-2024

arXiv.org PDF

Add feedback

Country:
- Asia
  - Singapore (0.04)
  - China > Shandong Province
    - Dongying (0.04)

Genre:
- Research Report > New Finding (0.48)

Industry:
- Education (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Robots > Manipulation (0.68)
  - Representation & Reasoning > Object-Oriented Architecture (0.48)
  - Machine Learning
    - Reinforcement Learning (0.67)
    - Neural Networks > Deep Learning (0.48)