GPS-MTM: Capturing Pattern of Normalcy in GPS-Trajectories with self-supervised learning

Garg, Umang, Zhang, Bowen, Subrahmanya, Anantajit, Gudavalli, Chandrakanth, Manjunath, BS

Oct-9-2025–arXiv.org Artificial Intelligence

Foundation models have driven remarkable progress in text, vision, and video understanding, and are now poised to unlock similar breakthroughs in trajectory modeling. We introduce the GPSMasked Trajectory Transformer (GPS-MTM), a foundation model for large-scale mobility data that captures patterns of normalcy in human movement. Unlike prior approaches that flatten trajectories into coordinate streams, GPS-MTM decomposes mobility into two complementary modalities: states (point-of-interest categories) and actions (agent transitions). Leveraging a bi-directional Transformer with a self-supervised masked modeling objective, the model reconstructs missing segments across modalities, enabling it to learn rich semantic correlations without manual labels. Across benchmark datasets, including Numosim-LA, Urban Anomalies, and Geolife, GPS-MTM consistently outperforms on downstream tasks such as trajectory infilling and next-stop prediction. Its advantages are most pronounced in dynamic tasks (inverse and forward dynamics), where contextual reasoning is critical. These results establish GPS-MTM as a robust foundation model for trajectory analytics, positioning mobility data as a first-class modality for large-scale representation learning. Code is released for further reference.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Oct-9-2025

arXiv.org PDF

Add feedback

Country:
- Africa > Zambia (0.04)
- Asia > China
  - Beijing > Beijing (0.04)
- Europe > Germany
  - Hamburg (0.04)
- North America > United States
  - California
    - Los Angeles County > Los Angeles (0.04)
    - Santa Barbara County > Santa Barbara (0.05)
  - New York > New York County
    - New York City (0.15)
  - Texas > Dallas County
    - Dallas (0.04)

Genre:
- Research Report (0.50)

Industry:
- Consumer Products & Services (0.46)
- Health & Medicine (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.68)
  - Natural Language > Large Language Model (0.68)
  - Representation & Reasoning (1.00)