Causal Temporal Prediction An Effective and Efficient Multi Modal Approach
–Neural Information Processing Systems
Spatio-temporal prediction plays a crucial role in intelligent transportation, weather forecasting, and urban planning. While integrating multi-modal data has shown potential for enhancing prediction accuracy, key challenges persist: (i) inadequate fusion of multi-modal information, (ii) confounding factors that obscure causal relations, and (iii) high computational complexity of prediction models. To address these challenges, we propose E2-CSTP, an Effective and Efficient Causal multimodal Spatio-Temporal Prediction framework. E2-CSTP leverages cross-modal attention and gating mechanisms to effectively integrate multi-modal data. Building on this, we design a dual-branch causal inference approach: the primary branch focuses on spatio-temporal prediction, while the auxiliary branch mitigates bias by modeling additional modalities and applying causal interventions to uncover true causal dependencies. To improve model efficiency, we integrate GCN with the Mamba architecture for accelerated spatio-temporal encoding. Extensive experiments on 4 real-world datasets show that E2-CSTP significantly outperforms 9 state-of-the-art methods, achieving up to 9.66% improvements in accuracy as well as 17.37%-56.11%
Neural Information Processing Systems
Jun-22-2026, 05:32:20 GMT
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Health & Medicine (0.67)
- Banking & Finance (0.67)
- Transportation > Ground
- Road (0.46)
- Technology:
- Information Technology
- Data Science > Data Mining (1.00)
- Communications (0.68)
- Artificial Intelligence
- Representation & Reasoning (1.00)
- Natural Language (1.00)
- Vision (0.93)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Information Technology