modelnet10-so
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- Europe (0.04)
- Asia > South Korea > Gyeongsangbuk-do > Pohang (0.04)
- (2 more...)
- Health & Medicine (0.67)
- Information Technology > Security & Privacy (0.46)
ProbabilisticOrientationEstimationwithMatrix FisherDistributions
This paper focuses on estimating probability distributions over the set of 3D rotations (SO(3)) using deep neural networks. Learning to regress models to the set of rotations is inherently difficult due to differences in topology between RN and SO(3). We overcome this issue by using a neural network to output the parameters for a matrix Fisher distribution since these parameters are homeomorphic toR9. By using a negative log likelihood loss for this distribution we get a loss which is convex with respect to the network outputs. By optimizing this loss we improve state-of-the-art on several challenging applicable datasets, namely Pascal3D+, ModelNet10-SO(3).
- Europe > Sweden > Stockholm > Stockholm (0.05)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
- Europe > Latvia > Riga Municipality > Riga (0.04)
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- Europe (0.04)
- Asia > South Korea > Gyeongsangbuk-do > Pohang (0.04)
- (2 more...)
- Health & Medicine (0.67)
- Information Technology > Security & Privacy (0.46)
- Europe > Sweden > Stockholm > Stockholm (0.04)
- North America > Canada (0.04)
- Europe > Latvia > Riga Municipality > Riga (0.04)
- Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)
3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction
Determining the 3D orientations of an object in an image, known as single-image pose estimation, is a crucial task in 3D vision applications. Existing methods typically learn 3D rotations parametrized in the spatial domain using Euler angles or quaternions, but these representations often introduce discontinuities and singularities. SO(3)-equivariant networks enable the structured capture of pose patterns with data-efficient learning, but the parametrizations in spatial domain are incompatible with their architecture, particularly spherical CNNs, which operate in the frequency domain to enhance computational efficiency. To overcome these issues, we propose a frequency-domain approach that directly predicts Wigner-D coefficients for 3D rotation regression, aligning with the operations of spherical CNNs. Our SO(3)-equivariant pose harmonics predictor overcomes the limitations of spatial parameterizations, ensuring consistent pose estimation under arbitrary rotations. Trained with a frequency-domain regression loss, our method achieves state-of-the-art results on benchmarks such as ModelNet10-SO(3) and PASCAL3D+, with significant improvements in accuracy, robustness, and data efficiency.
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- Europe (0.04)
- Asia > South Korea > Gyeongsangbuk-do > Pohang (0.04)
- (2 more...)