Expressive and Scalable Quantum Fusion for Multimodal Learning

Nguyen, Tuyen, Hoang, Trong Nghia, Nguyen, Phi Le, Vu, Hai L., Thang, Truong Cong

Oct-9-2025–arXiv.org Artificial Intelligence

The aim of this paper is to introduce a quantum fusion mechanism for multimodal learning and to establish its theoretical and empirical potential. The proposed method, called the Quantum Fusion Layer (QFL), replaces classical fusion schemes with a hybrid quantum-classical procedure that uses parameterized quantum circuits to learn entangled feature interactions without requiring exponential parameter growth. Supported by quantum signal processing principles, the quantum component efficiently represents high-order polynomial interactions across modalities with linear parameter scaling, and we provide a separation example between QFL and low-rank tensor-based methods that highlights potential quantum query advantages. In simulation, QFL consistently outperforms strong classical baselines on small but diverse multimodal tasks, with particularly marked improvements in high-modality regimes. These results suggest that QFL offers a fundamentally new and scalable approach to multimodal fusion that merits deeper exploration on larger systems.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Oct-9-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.46)
- Asia (0.28)

Genre:
- Research Report > New Finding (0.87)

Industry:
- Information Technology (0.67)
- Health & Medicine
  - Therapeutic Area > Cardiology/Vascular Diseases (0.46)
  - Diagnostic Medicine (0.46)

Technology:
- Information Technology
  - Hardware (1.00)
  - Data Science (1.00)
  - Communications (0.93)
  - Artificial Intelligence
    - Vision (1.00)
    - Natural Language (1.00)
    - Representation & Reasoning > Information Fusion (0.68)
    - Robots (0.67)
    - Machine Learning > Neural Networks
      - Deep Learning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found