AITopics | Geophysical Analysis & Survey

Collaborating Authors

Geophysical Analysis & Survey

Efficient Few-Shot Learning in Remote Sensing: Fusing Vision and Vision-Language Models

Chua, Jia Yun, Zolotas, Argyrios, Arana-Catania, Miguel

arXiv.org Artificial IntelligenceOct-17-2025

Remote sensing has become a vital tool across sectors such as urban planning, environmental monitoring, and disaster response. While the volume of data generated has increased significantly, traditional vision models are often constrained by the requirement for extensive domain-specific labelled data and their limited ability to understand the context within complex environments. Vision Language Models offer a complementary approach by integrating visual and textual data; however, their application to remote sensing remains underexplored, particularly given their generalist nature. This work investigates the combination of vision models and VLMs to enhance image analysis in remote sensing, with a focus on aircraft detection and scene understanding. The integration of YOLO with VLMs such as LLaVA, ChatGPT, and Gemini aims to achieve more accurate and contextually aware image interpretation. Performance is evaluated on both labelled and unlabelled remote sensing data, as well as degraded image scenarios which are crucial for remote sensing. The findings show an average MAE improvement of 48.46% across models in the accuracy of aircraft detection and counting, especially in challenging conditions, in both raw and degraded scenarios. A 6.17% improvement in CLIPScore for comprehensive understanding of remote sensing images is obtained. The proposed approach combining traditional vision models and VLMs paves the way for more advanced and efficient remote sensing image analysis, especially in few-shot learning scenarios.

large language model, machine learning, vlm, (22 more...)

arXiv.org Artificial Intelligence

2510.13993

Country: Asia > Japan (0.28)

Genre: Research Report > New Finding (1.00)

Industry: Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

UrbanFusion: Stochastic Multimodal Fusion for Contrastive Learning of Robust Spatial Representations

Mühlematter, Dominik J., Che, Lin, Hong, Ye, Raubal, Martin, Wiedemann, Nina

arXiv.org Artificial IntelligenceOct-16-2025

Forecasting urban phenomena such as housing prices and public health indicators requires the effective integration of various geospatial data. Current methods primarily utilize task-specific models, while recent foundation models for spatial representations often support only limited modalities and lack multimodal fusion capabilities. To overcome these challenges, we present UrbanFusion, a Geo-Foundation Model (GeoFM) that features Stochastic Multimodal Fusion (SMF). The framework employs modality-specific encoders to process different types of inputs, including street view imagery, remote sensing data, cartographic maps, and points of interest (POIs) data. These multimodal inputs are integrated via a Transformer-based fusion module that learns unified representations. An extensive evaluation across 41 tasks in 56 cities worldwide demonstrates UrbanFusion's strong generalization and predictive performance compared to state-of-the-art GeoAI models. Specifically, it 1) outperforms prior foundation models on location-encoding, 2) allows multimodal input during inference, and 3) generalizes well to regions unseen during training. UrbanFusion can flexibly utilize any subset of available modalities for a given location during both pretraining and inference, enabling broad applicability across diverse data availability scenarios. All source code is available at https://github.com/DominikM198/UrbanFusion.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2510.13774

Country:

North America > United States (1.00)
Europe (1.00)
Asia (1.00)

Genre: Research Report > New Finding (0.92)

Industry:

Banking & Finance > Real Estate (0.66)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.47)
Transportation > Ground > Road (0.45)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.87)

Add feedback

Harnessing Self-Supervised Deep Learning and Geostationary Remote Sensing for Advancing Wildfire and Associated Air Quality Monitoring: Improved Smoke and Fire Front Masking using GOES and TEMPO Radiance Data

LaHaye, Nicholas, Munashinge, Thilanka, Lee, Hugo, Pan, Xiaohua, Abad, Gonzalo Gonzalez, Mahmoud, Hazem, Wei, Jennifer

arXiv.org Artificial IntelligenceOct-14-2025

This work demonstrates the possibilities for improving wildfire and air quality management in the western United States by leveraging the unprecedented hourly data from NASA's TEMPO satellite mission and advances in self-supervised deep learning. Here we demonstrate the efficacy of deep learning for mapping the near real-time hourly spread of wildfire fronts and smoke plumes using an innovative self-supervised deep learning-system: successfully distinguishing smoke plumes from clouds using GOES-18 and TEMPO data, strong agreement across the smoke and fire masks generated from different sensing modalities as well as significant improvement over operational products for the same cases.

artificial intelligence, deep learning, machine learning, (10 more...)

arXiv.org Artificial Intelligence

2510.09845

Country: North America > United States > California (0.28)

Genre: Research Report (0.50)

Industry:

Government > Regional Government > North America Government > United States Government (0.71)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.42)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

FUSU: A Multi-temporal-source Land Use Change Segmentation Dataset for Fine-grained Urban Semantic Understanding

Neural Information Processing SystemsOct-10-2025, 20:58:48 GMT

Fine urban change segmentation using multi-temporal remote sensing images is essential for understanding human-environment interactions in urban areas.

change detection, dataset, segmentation, (15 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Asia > China > Shaanxi Province > Xi'an (0.06)
Europe > Germany > Brandenburg > Potsdam (0.04)
(2 more...)

Genre: Research Report (0.46)

Industry:

Law > Real Estate Law (0.43)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.37)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

3D Semantic Understanding from Monocular Remote Sensing Imagery

Neural Information Processing SystemsOct-10-2025, 17:45:46 GMT

Section A.1 outlines the generation process of the SynRS3D dataset, including the tools and It also covers the licenses for these plugins. Section A.4 describes the experimental setup and the selection of hyperparameters for the RS3DAda method. Section A.5 presents the ablation study results and analysis for the RS3DAda method. Section A.6 provides supplementary experimental The generation workflow of SynRS3D involves several key steps, from initializing sensor and sunlight parameters to generating the layout, geometry, and textures of the scene. Initialization: Set up the sensor and sunlight parameters using uniform and normal distributions to simulate various conditions.

dataset, please provide, synrs3d, (13 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.15)
Europe > Germany > Brandenburg > Potsdam (0.04)
Asia > China > Shanghai > Shanghai (0.04)
(9 more...)

Genre:

Workflow (1.00)
Research Report > New Finding (0.46)

Industry:

Law (1.00)
Government (0.93)
Information Technology (0.69)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.53)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Understanding from Monocular Remote Sensing Imagery

Neural Information Processing SystemsOct-10-2025, 17:45:43 GMT

This task combines land cover mapping, which is closely related to semantic segmentation in computer vision, and height estimation.

dataset, remote sensing, synrs3d, (9 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > Germany > Brandenburg > Potsdam (0.04)
Asia > Middle East > Jordan (0.04)
(13 more...)

Genre:

Workflow (0.93)
Research Report > Experimental Study (0.93)
Research Report > New Finding (0.67)

Industry:

Information Technology (1.00)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.54)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Add feedback

M3LEO: A Multi-Modal, Multi-Label Earth Observation Dataset Integrating Interferometric SAR and Multispectral Data

Neural Information Processing SystemsOct-10-2025, 15:10:34 GMT

Satellite-based remote sensing has revolutionised the way we address global challenges in a rapidly evolving world. Huge quantities of Earth Observation (EO) data are generated by satellite sensors daily, but processing these large datasets for use in ML pipelines is technically and computationally challenging. Specifically, different types of EO data are often hosted on a variety of platforms, with differing degrees of availability for Python preprocessing tools. In addition, spatial alignment across data sources and data tiling for easier handling can present significant technical hurdles for novice users.

dataset, m3leo, remote sensing, (15 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Asia > China > Guangdong Province (0.14)
(16 more...)

Industry:

Government (1.00)
Law (0.68)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.37)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science (0.93)
Information Technology > Information Management (0.88)

Add feedback

Learning De-Biased Representations for Remote-Sensing Imagery

Neural Information Processing SystemsOct-10-2025, 05:07:11 GMT

It is an unsupervised learning approach that can diversify minor class features based on the shared attributes with major classes, where the attributes are obtained by a simple step of clustering.

dataset, foundation model, tail class, (16 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Asia > China (0.04)
Africa (0.04)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.68)

Industry: Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.53)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(3 more...)

Add feedback

Supplementary Material for " AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite Imagery "

Neural Information Processing SystemsOct-10-2025, 04:11:20 GMT

In Sec. 2 we include a We include a datasheet for our dataset following the methodology from "Datasheets for Datasets" Ge-17 In this section, we include the prompts from Gebru et al. [2021] in blue, and in For what purpose was the dataset created? Was there a specific task in mind? The dataset was created to facilitate research development on cloud removal in satellite imagery. Specifically, our task is more temporally aligned than previous benchmarks. Who created the dataset (e.g., which team, research group) and on behalf of which entity (e.g., Who funded the creation of the dataset?

dataset, information, please provide, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Industry:

Law (1.00)
Government (0.68)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.61)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite Imagery Hangyu Zhou

Neural Information Processing SystemsOct-10-2025, 04:11:17 GMT

Clouds in satellite imagery pose a significant challenge for downstream applications. A major challenge in current cloud removal research is the absence of a comprehensive benchmark and a sufficiently large and diverse training dataset.

allclear, cloud removal, dataset, (17 more...)

Neural Information Processing Systems

Country: