Leveraging SE(3) Equivariance for Self-Supervised Category-Level Object Pose Estimation

Neural Information Processing Systems 

The key to our method is to disentangle shape and pose through an invariant shape reconstruction module and an equivariant pose estimation module, empowered by SE(3) equivariant point cloud networks. The invariant shape reconstruction module learns to perform aligned reconstructions, yielding a category-level reference frame without using any annotations.