Provable Dynamic Fusion for Low-Quality Multimodal Data

Zhang, Qingyang, Wu, Haitao, Zhang, Changqing, Hu, Qinghua, Fu, Huazhu, Zhou, Joey Tianyi, Peng, Xi

Jun-6-2023–arXiv.org Artificial Intelligence

The inherent challenge of multimodal fusion is to precisely capture the cross-modal correlation and flexibly conduct cross-modal interaction. To fully release the value of each modality and mitigate the influence of low-quality multimodal data, dynamic multimodal fusion emerges as a promising learning paradigm. Despite its widespread use, theoretical justifications in this field are still notably lacking. Can we design a provably robust multimodal fusion method? This paper provides theoretical understandings to answer this question under a most popular multimodal fusion framework from the generalization perspective. We proceed to reveal that several uncertainty estimation solutions are naturally available to achieve robust multimodal fusion. Then a novel multimodal fusion framework termed Quality-aware Multimodal Fusion (QMF) is proposed, which can improve the performance in terms of classification accuracy and model robustness. Extensive experimental results on multiple benchmarks can support our findings.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Jun-6-2023

arXiv.org PDF

Add feedback

Country:
- Oceania > New Zealand (0.04)
- North America > United States
  - New York (0.04)
  - California (0.04)
  - Hawaii > Honolulu County
    - Honolulu (0.04)
- Asia
  - Singapore (0.04)
  - China
    - Tianjin Province > Tianjin (0.05)
    - Sichuan Province > Chengdu (0.04)
    - Hong Kong (0.04)

Genre:
- Research Report > New Finding (0.66)

Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.93)

Technology:
- Information Technology
  - Data Science (1.00)
  - Sensing and Signal Processing > Image Processing (0.93)
  - Artificial Intelligence
    - Vision (1.00)
    - Natural Language (1.00)
    - Representation & Reasoning
      - Uncertainty (0.68)
      - Information Fusion (0.50)
    - Machine Learning
      - Neural Networks (0.69)
      - Performance Analysis > Accuracy (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found