Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image

Oct-9-2024, 14:03:04 GMT–Neural Information Processing Systems

Inferring 3D locations and shapes of multiple objects from a single 2D image is a long-standing objective of computer vision. Most of the existing works either predict one of these 3D properties or focus on solving both for a single object. One fundamental challenge lies in how to learn an effective representation of the image that is well-suited for 3D detection and reconstruction. In this work, we propose to learn a regular grid of 3D voxel features from the input image which is aligned with 3D scene space via a 3D feature lifting operator. Based on the 3D voxel features, our novel CenterNet-3D detection head formulates the 3D detection as keypoint detection in the 3D space.

detection and reconstruction, multiple object, voxel feature, (2 more...)

Neural Information Processing Systems

Oct-9-2024, 14:03:04 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Vision (0.43)