Single-View 3D Reconstruction via SO(2)-Equivariant Gaussian Sculpting Networks
Xu, Ruihan, Opipari, Anthony, Mah, Joshua, Lewis, Stanley, Zhang, Haoran, Guo, Hanzhe, Jenkins, Odest Chadwicke
–arXiv.org Artificial Intelligence
This paper introduces SO(2)-Equivariant Gaussian Sculpting Networks (GSNs) as an approach for SO(2)-Equivariant 3D object reconstruction from single-view image observations. GSNs take a single observation as input to generate a Gaussian splat representation describing the observed object's geometry and texture. By using a shared feature extractor before decoding Gaussian colors, covariances, positions, and opacities, GSNs achieve extremely high throughput (>150FPS). Experiments demonstrate that GSNs can be trained efficiently using a multi-view rendering loss and are competitive, in quality, with expensive diffusion-based reconstruction algorithms. The GSN model is validated on multiple benchmark experiments. Moreover, we demonstrate the potential for GSNs to be used within a robotic manipulation pipeline for object-centric grasping.
arXiv.org Artificial Intelligence
Sep-11-2024
- Country:
- North America > United States
- Massachusetts (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.14)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia > Japan
- Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Robots (1.00)
- Machine Learning (1.00)
- Information Technology > Artificial Intelligence