3D Object Detection and Viewpoint Estimation with a Deformable 3D Cuboid Model

Mar-14-2024, 11:03:08 GMT–Neural Information Processing Systems

This paper addresses the problem of category-level 3D object detection. Given a monocular image, our aim is to localize the objects in 3D by enclosing them with tight oriented 3D bounding boxes. We propose a novel approach that extends the well-acclaimed deformable part-based model [1] to reason in 3D. Our model represents an object class as a deformable 3D cuboid composed of faces and parts, which are both allowed to deform with respect to their anchors on the 3D box. We model the appearance of each face in fronto-parallel coordinates, thus effectively factoring out the appearance variation induced by viewpoint.

cuboid, deformation, detection, (16 more...)

Neural Information Processing Systems

Mar-14-2024, 11:03:08 GMT

Conferences PDF

Add feedback

Country:
- North America
  - United States > Illinois
    - Cook County > Chicago (0.04)
  - Canada > Ontario
    - Toronto (0.14)

Genre:
- Research Report (0.34)

Industry:
- Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Representation & Reasoning > Object-Oriented Architecture (0.68)
  - Machine Learning
    - Inductive Learning (0.70)
    - Statistical Learning (0.68)
    - Supervised Learning (0.48)