A Survey on Interpretable Cross-modal Reasoning
Xue, Dizhan, Qian, Shengsheng, Zhou, Zuyi, Xu, Changsheng
–arXiv.org Artificial Intelligence
In recent years, cross-modal reasoning (CMR), the process of understanding and reasoning across different modalities, has emerged as a pivotal area with applications spanning from multimedia analysis to healthcare diagnostics. As the deployment of AI systems becomes more ubiquitous, the demand for transparency and comprehensibility in these systems' decision-making processes has intensified. This survey delves into the realm of interpretable cross-modal reasoning (I-CMR), where the objective is not only to achieve high predictive performance but also to provide human-understandable explanations for the results. This survey presents a comprehensive overview of the typical methods with a three-level taxonomy for I-CMR. Furthermore, this survey reviews the existing CMR datasets with annotations for explanations. Finally, this survey summarizes the challenges for I-CMR and discusses potential future directions. In conclusion, this survey aims to catalyze the progress of this emerging research area by providing researchers with a panoramic and comprehensive perspective, illuminating the state of the art and discerning the opportunities. The summarized methods, datasets, and other resources are available at https://github.com/ZuyiZhou/Awesome-Interpretable-Cross-modal-Reasoning.
arXiv.org Artificial Intelligence
Sep-14-2023
- Country:
- North America > Canada
- Asia > China
- Beijing > Beijing (0.04)
- Guangdong Province > Shenzhen (0.04)
- Genre:
- Overview (1.00)
- Industry:
- Health & Medicine (0.87)
- Technology:
- Information Technology
- Sensing and Signal Processing > Image Processing (1.00)
- Artificial Intelligence
- Vision (1.00)
- Representation & Reasoning > Expert Systems (1.00)
- Cognitive Science > Problem Solving (1.00)
- Natural Language
- Text Processing (1.00)
- Explanation & Argumentation (1.00)
- Machine Learning > Neural Networks
- Deep Learning (0.93)
- Information Technology