AITopics | photogrammetry

Collaborating Authors

photogrammetry

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

EPIC Fields: Marrying 3D Geometry and Video Understanding

Neural Information Processing SystemsDec-25-2025, 07:31:06 GMT

Neural rendering is fuelling a unification of learning, 3D geometry and video understanding that has been waiting for more than two decades. Progress, however, is still hampered by a lack of suitable datasets and benchmarks. To address this gap, we introduce EPIC Fields, an augmentation of EPIC-KITCHENS with 3D camera information. Like other datasets for neural rendering, EPIC Fields removes the complex and expensive step of reconstructing cameras using photogrammetry, and allows researchers to focus on modelling problems. We illustrate the challenge of photogrammetry in egocentric videos of dynamic actions and propose innovations to address them. Compared to other neural rendering datasets, EPIC Fields is better tailored to video understanding because it is paired with labelled action segments and the recent VISOR segment annotations. To further motivate the community, we also evaluate two benchmark tasks in neural rendering and segmenting dynamic objects, with strong baselines that showcase what is not possible today. We also highlight the advantage of geometry in semi-supervised video object segmentations on the VISOR annotations.

electronic proceedings, epic field, name change, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Vision (1.00)

Add feedback

Evaluation of Flight Parameters in UAV-based 3D Reconstruction for Rooftop Infrastructure Assessment

Chodura, Nick, Greeff, Melissa, Woods, Joshua

arXiv.org Artificial IntelligenceNov-7-2025

Rooftop 3D reconstruction using UAV-based photogrammetry offers a promising solution for infrastructure assessment, but existing methods often require high percentages of image overlap and extended flight times to ensure model accuracy when using autonomous flight paths. This study systematically evaluates key flight parameters-ground sampling distance (GSD) and image overlap-to optimize the 3D reconstruction of complex rooftop infrastructure. Controlled UAV flights were conducted over a multi-segment rooftop at Queen's University using a DJI Phantom 4 Pro V2, with varied GSD and overlap settings. The collected data were processed using Reality Capture software and evaluated against ground truth models generated from UAV-based LiDAR and terrestrial laser scanning (TLS). Experimental results indicate that a GSD range of 0.75-1.26 cm combined with 85% image overlap achieves a high degree of model accuracy, while minimizing images collected and flight time. These findings provide guidance for planning autonomous UAV flight paths for efficient rooftop assessments.

accuracy, artificial intelligence, reconstruction, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.22260/ISARC2025/0193

2504.02084

Country: North America > Canada > Ontario (0.14)

Genre: Research Report (1.00)

Industry:

Information Technology > Robotics & Automation (0.88)
Aerospace & Defense > Aircraft (0.68)

Technology:

Information Technology > Sensing and Signal Processing (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)

Add feedback

Design of a Bed Rotation Mechanism to Facilitate In-Situ Photogrammetric Reconstruction of Printed Parts

Roberts, Travis A., Karmakar, Sourabh, Turner, Cameron J.

arXiv.org Artificial IntelligenceOct-24-2025

Additive manufacturing, or 3D printing, is a complex process that creates free-form geometric objects by sequentially placing material to construct an object, usually in a layer-by-layer process. One of the most widely used methods is Fused Deposition Modeling (FDM). FDM is used in many of the consumer-grade polymer 3D printers available today. While consumer grade machines are cheap and plentiful, they lack many of the features desired in a machine used for research purposes and are often closed-source platforms. Commercial-grade models are more expensive and are also usually closed-source platforms that do not offer flexibility for modifications often needed for research. The authors designed and fabricated a machine to be used as a test bed for research in the field of polymer FDM processes. The goal was to create a platform that tightly controls and/or monitors the FDM build parameters so that experiments can be repeated with a known accuracy. The platform offers closed loop position feedback, control of the hot end and bed temperature, and monitoring of environment temperature and humidity. Additionally, the platform is equipped with cameras and a mechanism for in-situ photogrammetry, creating a geometric record of the printing throughout the printing process. Through photogrammetry, backtracking and linking process parameters to observable geometric defects can be achieved. This paper focuses on the design of a novel mechanism for spinning the heated bed to allow for photogrammetric reconstruction of the printed part using a minimal number of cameras, as implemented on this platform.

artificial intelligence, machine learning, printing process, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1115/DETC2022-91106

2510.20079

Genre: Research Report (0.64)

Industry: Machinery > Industrial Machinery (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Hardware (0.68)

Add feedback

L2M-Reg: Building-level Uncertainty-aware Registration of Outdoor LiDAR Point Clouds and Semantic 3D City Models

Xu, Ziyang, Schwab, Benedikt, Yang, Yihui, Kolbe, Thomas H., Holst, Christoph

arXiv.org Artificial IntelligenceSep-24-2025

Accurate registration between LiDAR (Light Detection and Ranging) point clouds and semantic 3D city models is a fundamental topic in urban digital twinning and a prerequisite for downstream tasks, such as digital construction, change detection and model refinement. However, achieving accurate LiDAR-to-Model registration at individual building level remains challenging, particularly due to the generalization uncertainty in semantic 3D city models at the Level of Detail 2 (LoD2). This paper addresses this gap by proposing L2M-Reg, a plane-based fine registration method that explicitly accounts for model uncertainty. L2M-Reg consists of three key steps: establishing reliable plane correspondence, building a pseudo-plane-constrained Gauss-Helmert model, and adaptively estimating vertical translation. Experiments on three real-world datasets demonstrate that L2M-Reg is both more accurate and computationally efficient than existing ICP-based and plane-based methods. Overall, L2M-Reg provides a novel building-level solution regarding LiDAR-to-Model registration when model uncertainty is present.

artificial intelligence, point cloud, registration, (17 more...)

arXiv.org Artificial Intelligence

2509.16832

Country: Europe > Germany > Bavaria > Upper Bavaria (0.15)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Add feedback

TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset

Wysocki, Olaf, Schwab, Benedikt, Biswanath, Manoj Kumar, Greza, Michael, Zhang, Qilin, Zhu, Jingwei, Froech, Thomas, Heeramaglore, Medhini, Hijazi, Ihab, Kanna, Khaoula, Pechinger, Mathias, Chen, Zhaiyu, Sun, Yao, Segura, Alejandro Rueda, Xu, Ziyang, AbdelGafar, Omar, Mehranfar, Mansour, Yeshwanth, Chandan, Liu, Yueh-Cheng, Yazdi, Hadi, Wang, Jiapan, Auer, Stefan, Anders, Katharina, Bogenberger, Klaus, Borrmann, Andre, Dai, Angela, Hoegner, Ludwig, Holst, Christoph, Kolbe, Thomas H., Ludwig, Ferdinand, Nießner, Matthias, Petzold, Frank, Zhu, Xiao Xiang, Jutzi, Boris

arXiv.org Artificial IntelligenceMay-14-2025

Urban Digital Twins (UDTs) have become essential for managing cities and integrating complex, heterogeneous data from diverse sources. Creating UDTs involves challenges at multiple process stages, including acquiring accurate 3D source data, reconstructing high-fidelity 3D models, maintaining models' updates, and ensuring seamless interoperability to downstream tasks. Current datasets are usually limited to one part of the processing chain, hampering comprehensive UDTs validation. To address these challenges, we introduce the first comprehensive multimodal Urban Digital Twin benchmark dataset: TUM2TWIN. This dataset includes georeferenced, semantically aligned 3D models and networks along with various terrestrial, mobile, aerial, and satellite observations boasting 32 data subsets over roughly 100,000 $m^2$ and currently 767 GB of data. By ensuring georeferenced indoor-outdoor acquisition, high accuracy, and multimodal data integration, the benchmark supports robust analysis of sensors and the development of advanced reconstruction methods. Additionally, we explore downstream tasks demonstrating the potential of TUM2TWIN, including novel view synthesis of NeRF and Gaussian Splatting, solar potential analysis, point cloud semantic segmentation, and LoD3 building reconstruction. We are convinced this contribution lays a foundation for overcoming current limitations in UDT creation, fostering new research directions and practical solutions for smarter, data-driven urban environments. The project is available under: https://tum2t.win

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2505.07396

Country:

North America > United States (0.28)
Europe > Germany > Bavaria (0.14)

Genre: Research Report (0.82)

Industry:

Transportation > Ground > Road (1.00)
Energy > Renewable (1.00)
Government (0.93)
Information Technology (0.93)

Technology:

Information Technology > Sensing and Signal Processing (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
(4 more...)

Add feedback

EPIC Fields: Marrying 3D Geometry and Video Understanding

Neural Information Processing SystemsJan-18-2025, 06:42:24 GMT

epic field, neural rendering, photogrammetry, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Vision > Video Understanding (0.69)

Add feedback

Toward Appearance-based Autonomous Landing Site Identification for Multirotor Drones in Unstructured Environments

Springer, Joshua, Guðmundsson, Gylfi Þór, Kyas, Marcel

arXiv.org Artificial IntelligenceDec-19-2024

A remaining challenge in multirotor drone flight is the autonomous identification of viable landing sites in unstructured environments. One approach to solve this problem is to create lightweight, appearance-based terrain classifiers that can segment a drone's RGB images into safe and unsafe regions. However, such classifiers require data sets of images and masks that can be prohibitively expensive to create. We propose a pipeline to automatically generate synthetic data sets to train these classifiers, leveraging modern drones' ability to survey terrain automatically and the ability to automatically calculate landing safety masks from terrain models derived from such surveys. We then train a U-Net on the synthetic data set, test it on real-world data for validation, and demonstrate it on our drone platform in real-time.

artificial intelligence, classifier, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2412.15486

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > Iceland > Capital Region > Reykjavik (0.04)
Asia > China (0.04)

Genre: Research Report (0.50)

Industry: Information Technology (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset

Wysocki, Olaf, Tan, Yue, Froech, Thomas, Xia, Yan, Wysocki, Magdalena, Hoegner, Ludwig, Cremers, Daniel, Holst, Christoph

arXiv.org Artificial IntelligenceDec-19-2024

Facade semantic segmentation is a long-standing challenge in photogrammetry and computer vision. Although the last decades have witnessed the influx of facade segmentation methods, there is a lack of comprehensive facade classes and data covering the architectural variability. In ZAHA, we introduce Level of Facade Generalization (LoFG), novel hierarchical facade classes designed based on international urban modeling standards, ensuring compatibility with real-world challenging classes and uniform methods' comparison. Realizing the LoFG, we present to date the largest semantic 3D facade segmentation dataset, providing 601 million annotated points at five and 15 classes of LoFG2 and LoFG3, respectively. Moreover, we analyze the performance of baseline semantic segmentation methods on our introduced LoFG classes and data, complementing it with a discussion on the unresolved challenges for facade segmentation. We firmly believe that ZAHA shall facilitate further development of 3D facade semantic segmentation methods, enabling robust segmentation indispensable in creating urban digital twins.

artificial intelligence, machine learning, segmentation, (18 more...)

arXiv.org Artificial Intelligence

2411.04865

Country:

North America > United States (0.14)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
(8 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Communications (0.93)

Add feedback

Satellite Sunroof: High-res Digital Surface Models and Roof Segmentation for Global Solar Mapping

Batchu, Vishal, Wilson, Alex, Peng, Betty, Elkin, Carl, Jain, Umangi, Van Arsdale, Christopher, Goroshin, Ross, Gulshan, Varun

arXiv.org Artificial IntelligenceAug-29-2024

The transition to renewable energy, particularly solar, is key to mitigating climate change. Google's Solar API aids this transition by estimating solar potential from aerial imagery, but its impact is constrained by geographical coverage. This paper proposes expanding the API's reach using satellite imagery, enabling global solar potential assessment. We tackle challenges involved in building a Digital Surface Model (DSM) and roof instance segmentation from lower resolution and single oblique views using deep learning models. Our models, trained on aligned satellite and aerial datasets, produce 25cm DSMs and roof segments. With ~1m DSM MAE on buildings, ~5deg roof pitch error and ~56% IOU on roof segmentation, they significantly enhance the Solar API's potential to promote solar adoption.

imagery, roof segment, satellite imagery, (13 more...)

arXiv.org Artificial Intelligence

2408.144

Country:

North America > United States (0.14)
North America > Canada > Ontario > Toronto (0.14)
Asia > Middle East > Jordan (0.04)
(12 more...)

Genre: Research Report (0.50)

Industry:

Energy > Renewable > Solar (0.46)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.38)

Technology:

Information Technology > Artificial Intelligence > Vision (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)

Add feedback

Voxel-Based Point Cloud Localization for Smart Spaces Management

Mortazavi, F. S., Shkedova, O., Feuerhake, U., Brenner, C., Sester, M.

arXiv.org Artificial IntelligenceJun-21-2024

This paper proposes a voxel-based approach for creating a digital twin of an urban environment that is capable of efficiently managing smart spaces. The paper explains the registration and localization procedure of the point cloud dataset, which uses the KISS ICP for scan point cloud combination and the RANSAC method for the initial alignment of the combined point cloud. The mobile mapping point cloud using Riegl VMX-250 serves as the reference map, and Velodyne scans are used for localization purposes. The point-to-plane iterative closest-point method is then employed to refine the alignment. The paper evaluates the efficacy of the proposed method by calculating the errors between the estimated and ground truth positions. The results indicate that the voxel-based approach is capable of accurately estimating the position of the sensor platform, which are applicable for various use cases. A specific use case in the context is smart parking space management, which is described and initial visualization results are shown.

point cloud, registration, velodyne scan, (11 more...)

arXiv.org Artificial Intelligence

doi: 10.5194/isprs-archives-XLVIII-1-W1-2023-325-2023

2406.1511

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Germany > Lower Saxony > Hanover (0.14)
Europe > Italy (0.05)
(6 more...)

Genre: Research Report (1.00)

Industry: Energy (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Robots (0.68)
Information Technology > Artificial Intelligence > Vision (0.68)

Add feedback