GeoJEPA: Towards Eliminating Augmentation- and Sampling Bias in Multimodal Geospatial Learning

Feb-25-2025–arXiv.org Artificial Intelligence

Existing methods for self-supervised representation learning of geospatial regions and map entities rely extensively on the design of pretext tasks, often involving augmentations or heuristic sampling of positive and negative pairs based on spatial proximity. This reliance introduces biases and limits the representations' expressiveness and generalisability. Consequently, the literature has expressed a pressing need to explore different methods for modelling geospatial data. To address the key difficulties of such methods, namely multimodality, heterogeneity, and the choice of pretext tasks, we present GeoJEPA, a versatile multimodal fusion model for geospatial data built on the self-supervised Joint-Embedding Predictive Architecture. With GeoJEPA, we aim to eliminate the widely accepted augmentation- and sampling biases found in self-supervised geospatial representation learning. GeoJEPA uses self-supervised pretraining on a large dataset of OpenStreetMap attributes, geometries and aerial images. The results are multimodal semantic representations of urban regions and map entities that we evaluate both quantitatively and qualitatively. Through this work, we uncover several key insights into JEPA's ability to handle multimodal data.

accessed, learning, representation, (13 more...)

arXiv.org Artificial Intelligence

Feb-25-2025

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Virginia (0.04)
    - South Carolina (0.04)
    - North Carolina (0.04)
    - Maryland (0.04)
    - Tennessee > Davidson County
      - Nashville (0.04)
    - Georgia > Fulton County
      - Atlanta (0.04)
    - California
      - San Francisco County > San Francisco (0.04)
      - Los Angeles County
        Los Angeles (0.04)
        Long Beach (0.04)
  - Canada
    - Quebec > Montreal (0.04)
    - British Columbia > Vancouver (0.04)
- Europe
  - Switzerland (0.04)
  - United Kingdom > England
    - Greater London > London (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)
- Asia
  - Japan (0.04)
  - Singapore > Central Region
    - Singapore (0.04)
  - China > Beijing
    - Beijing (0.04)

Genre:
- Research Report (1.00)
- Overview (0.67)

Industry:
- Transportation
  - Infrastructure & Services (1.00)
  - Ground > Road (1.00)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (1.00)
  - Data Science > Data Mining (1.00)
  - Artificial Intelligence
    - Representation & Reasoning > Spatial Reasoning (1.00)
    - Natural Language
      - Text Processing (1.00)
      - Large Language Model (1.00)
      - Information Retrieval (0.92)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found