Neural Descriptor Fields: SE(3)-Equivariant Object Representations for Manipulation

Simeonov, Anthony, Du, Yilun, Tagliasacchi, Andrea, Tenenbaum, Joshua B., Rodriguez, Alberto, Agrawal, Pulkit, Sitzmann, Vincent

Dec-9-2021–arXiv.org Artificial Intelligence

We present Neural Descriptor Fields (NDFs), an object representation that encodes both points and relative poses between an object and a target (such as a robot gripper or a rack used for hanging) via category-level descriptors. We employ this representation for object manipulation, where given a task demonstration, we want to repeat the same task on a new object instance from the same category. We propose to achieve this objective by searching (via optimization) for the pose whose descriptor matches that observed in the demonstration. NDFs are conveniently trained in a self-supervised fashion via a 3D auto-encoding task that does not rely on expert-labeled keypoints. Further, NDFs are SE(3)-equivariant, guaranteeing performance that generalizes across all possible 3D object translations and rotations. We demonstrate learning of manipulation tasks from few (5-10) demonstrations both in simulation and on a real robot. Our performance generalizes across both object instances and 6-DoF object poses, and significantly outperforms a recent baseline that relies on 2D descriptors. Project website: https://yilundu.github.io/ndf/.

artificial intelligence, demonstration, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Dec-9-2021

arXiv.org PDF

Add feedback

Country:
- Asia > Singapore (0.04)
- Europe > Norway
  - North Sea > Central North Sea (0.04)
- North America
  - Canada > Ontario
    - Toronto (0.14)
  - United States
    - Massachusetts (0.04)
    - Oklahoma > Beaver County (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (0.46)
  - Robots (1.00)
  - Vision (1.00)