3DCoMPaT200: Language Grounded Large-Scale 3D Vision Dataset for Compositional Recognition
–Neural Information Processing Systems
Understanding objects in 3D at the part level is essential for humans and robots to navigate and interact with the environment. Current datasets for part-level 3D object understanding encompass a limited range of categories. For instance, the ShapeNet-Part and PartNet datasets only include 16, and 24 object categories respectively. The 3DCoMPaT dataset, specifically designed for compositional understanding of parts and materials, contains only 42 object categories. To foster richer and fine-grained part-level 3D understanding, we introduce 3DCoMPaT200, a large-scale dataset tailored for compositional understanding of object parts and materials, with 200 object categories with approximately 5 times larger object vocabulary compared to 3DCoMPaT and almost 4 times larger part categories.
Neural Information Processing Systems
May-27-2025, 21:19:17 GMT
- Technology:
- Information Technology > Artificial Intelligence > Robots (0.61)