Multimodal and Multiview Deep Fusion for Autonomous Marine Navigation

Dagdilelis, Dimitrios, Grigoriadis, Panagiotis, Galeazzi, Roberto

May-6-2025–arXiv.org Artificial Intelligence

We propose a cross attention transformer based method for multimodal sensor fusion to build a birds eye view of a vessels surroundings supporting safer autonomous marine navigation. The model deeply fuses multiview RGB and long wave infrared images with sparse LiDAR point clouds. Training also integrates X band radar and electronic chart data to inform predictions. The resulting view provides a detailed reliable scene representation improving navigational accuracy and robustness. Real world sea trials confirm the methods effectiveness even in adverse weather and complex maritime settings.

detection, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

May-6-2025

arXiv.org PDF

Add feedback

Genre:
- Research Report (1.00)

Industry:
- Transportation (0.94)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (0.93)
  - Artificial Intelligence
    - Vision (1.00)
    - Robots (1.00)
    - Representation & Reasoning > Information Fusion (1.00)
    - Natural Language (0.88)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found