AITopics | structure-from-motion

Collaborating Authors

structure-from-motion

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

CuSfM: CUDA-Accelerated Structure-from-Motion

Yu, Jingrui, Liu, Jun, Ren, Kefei, Biswas, Joydeep, Ye, Rurui, Wu, Keqiang, Majithia, Chirag, Zeng, Di

arXiv.org Artificial IntelligenceOct-20-2025

Efficient and accurate camera pose estimation forms the foundational requirement for dense reconstruction in autonomous navigation, robotic perception, and virtual simulation systems. This paper addresses the challenge via cuSfM, a CUDA-accelerated offline Structure-from-Motion system that leverages GPU parallelization to efficiently employ computationally intensive yet highly accurate feature extractors, generating comprehensive and non-redundant data associations for precise camera pose estimation and globally consistent mapping. The system supports pose optimization, mapping, prior-map localization, and extrinsic refinement. It is designed for offline processing, where computational resources can be fully utilized to maximize accuracy. Experimental results demonstrate that cuSfM achieves significantly improved accuracy and processing speed compared to the widely used COLMAP method across various testing scenarios, while maintaining the high precision and global consistency essential for offline SfM applications. The system is released as an open-source Python wrapper implementation, PyCuSfM, available at https://github.com/nvidia-isaac/pyCuSFM, to facilitate research and applications in computer vision and robotics.

artificial intelligence, cusfm, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2510.15271

Genre: Research Report (0.70)

Industry: Information Technology (0.35)

Technology:

Information Technology > Artificial Intelligence > Vision (0.77)
Information Technology > Artificial Intelligence > Machine Learning (0.69)
Information Technology > Artificial Intelligence > Robots (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion

Pataki, Zador, Sarlin, Paul-Edouard, Schönberger, Johannes L., Pollefeys, Marc

arXiv.org Artificial IntelligenceApr-29-2025

While Structure-from-Motion (SfM) has seen much progress over the years, state-of-the-art systems are prone to failure when facing extreme viewpoint changes in low-overlap, low-parallax or high-symmetry scenarios. Because capturing images that avoid these pitfalls is challenging, this severely limits the wider use of SfM, especially by non-expert users. We overcome these limitations by augmenting the classical SfM paradigm with monocular depth and normal priors inferred by deep neural networks. Thanks to a tight integration of monocular and multi-view constraints, our approach significantly outperforms existing ones under extreme viewpoint changes, while maintaining strong performance in standard conditions. We also show that monocular priors can help reject faulty associations due to symmetries, which is a long-standing problem for SfM. This makes our approach the first capable of reliably reconstructing challenging indoor environments from few images. Through principled uncertainty propagation, it is robust to errors in the priors, can handle priors inferred by different models with little tuning, and will thus easily benefit from future progress in monocular depth and normal estimation. Our code is publicly available at https://github.com/cvg/mpsfm.

artificial intelligence, machine learning, reconstruction, (18 more...)

arXiv.org Artificial Intelligence

2504.2004

Genre: Research Report (0.50)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.54)

Add feedback

Learning Structure-from-Motion with Graph Attention Networks

Brynte, Lucas, Iglesias, José Pedro, Olsson, Carl, Kahl, Fredrik

arXiv.org Artificial IntelligenceDec-4-2023

In this paper we tackle the problem of learning Structure-from-Motion (SfM) through the use of graph attention networks. SfM is a classic computer vision problem that is solved though iterative minimization of reprojection errors, referred to as Bundle Adjustment (BA), starting from a good initialization. In order to obtain a good enough initialization to BA, conventional methods rely on a sequence of sub-problems (such as pairwise pose estimation, pose averaging or triangulation) which provides an initial solution that can then be refined using BA. In this work we replace these sub-problems by learning a model that takes as input the 2D keypoints detected across multiple views, and outputs the corresponding camera poses and 3D keypoint coordinates. Our model takes advantage of graph neural networks to learn SfM-specific primitives, and we show that it can be used for fast inference of the reconstruction for new and unseen sequences. The experimental results show that the proposed model outperforms competing learning-based methods, and challenges COLMAP while having lower runtime.

data augmentation, dpesfm, reconstruction, (15 more...)

arXiv.org Artificial Intelligence

2308.15984

Country:

Europe > Sweden > Vaestra Goetaland > Gothenburg (0.14)
North America > Canada > Ontario > Toronto (0.14)
Asia > Singapore (0.08)
(8 more...)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Distributed Global Structure-from-Motion with a Deep Front-End

Baid, Ayush, Lambert, John, Driver, Travis, Krishnan, Akshay, Stepanyan, Hayk, Dellaert, Frank

arXiv.org Artificial IntelligenceNov-30-2023

While initial approaches to Structure-from-Motion (SfM) revolved around both global and incremental methods, most recent applications rely on incremental systems to estimate camera poses due to their superior robustness. Though there has been tremendous progress in SfM `front-ends' powered by deep models learned from data, the state-of-the-art (incremental) SfM pipelines still rely on classical SIFT features, developed in 2004. In this work, we investigate whether leveraging the developments in feature extraction and matching helps global SfM perform on par with the SOTA incremental SfM approach (COLMAP). To do so, we design a modular SfM framework that allows us to easily combine developments in different stages of the SfM pipeline. Our experiments show that while developments in deep-learning based two-view correspondence estimation do translate to improvements in point density for scenes reconstructed with global SfM, none of them outperform SIFT when comparing with incremental SfM results on a range of datasets. Our SfM system is designed from the ground up to leverage distributed computation, enabling us to parallelize computation on multiple machines and scale to large scenes.

bundle adjustment, dataset, image pair, (15 more...)

arXiv.org Artificial Intelligence

2311.18801

Country:

Europe > Sweden > Vaestra Goetaland > Gothenburg (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
(2 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)

Add feedback

Creating Neural Search and Rescue Fly-Through Environments with Mega-NeRF

#artificialintelligenceDec-21-2021, 13:51:06 GMT

A new research collaboration between Carnegie Mellon and autonomous driving technology company Argo AI has developed an economical method for generating dynamic fly-through environments based on Neural Radiance Fields (NeRF), using footage captured by drones. Mega-NeRF offers interactive fly-bys based on drone footage, with on-demand LOD. For more detail (at better resolution), check out the video embedded at the end of this article. The new approach, called Mega-NeRF, obtains a 40x speed-up compared to the average Neural Radiance Fields rendering standard, as well as offering something notably different from the standard tanks and temples that recur in new NeRF papers. The new paper is titled Mega-NeRF: Scalable Construction of Large-Scale NeRFs for Virtual Fly-Throughs, and comes from three researchers at Carnegie Mellon, one of whom also represents Argo AI.

dataset, footage, mega-nerf, (12 more...)

#artificialintelligence

Country:

North America > United States > Indiana (0.05)
Asia > China > Guangdong Province > Shenzhen (0.05)

Industry: Information Technology (0.75)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.36)

Add feedback

openMVG/awesome_3DReconstruction_list

@machinelearnbotJan-31-2018, 01:41:54 GMT

Randomized Structure from Motion Based on Atomic 3D Models from Camera Triplets.

artificial intelligence, reconstruction, structure-from-motion, (15 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Vision (0.97)

Add feedback