Causal Inspired Multi Modal Recommendation
Yang, Jie, Gu, Chenyang, Liu, Zixuan
–arXiv.org Artificial Intelligence
Multimodal recommender systems enhance personalized recommendations in e-commerce and online advertising by integrating visual, textual, and user-item interaction data. However, existing methods often overlook two critical biases: (i) modal confounding, where latent factors (e.g., brand style or product category) simultaneously drive multiple modalities and influence user preference, leading to spurious feature-preference associations; (ii) interaction bias, where genuine user preferences are mixed with noise from exposure effects and accidental clicks. To address these challenges, we propose a Causal-inspired multimodal Recommendation framework. Specifically, we introduce a dual-channel cross-modal diffusion module to identify hidden modal confounders, utilize back-door adjustment with hierarchical matching and vector-quantized codebooks to block confounding paths, and apply front-door adjustment combined with causal topology reconstruction to build a deconfounded causal subgraph. Extensive experiments on three real-world e-commerce datasets demonstrate that our method significantly outperforms state-of-the-art baselines while maintaining strong interpretability.
arXiv.org Artificial Intelligence
Oct-15-2025
- Country:
- Asia
- China
- Shandong Province (0.04)
- Shanghai > Shanghai (0.04)
- Singapore (0.04)
- China
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- New York > New York County > New York City (0.04)
- Asia
- Genre:
- Research Report (1.00)
- Industry:
- Information Technology > Services (0.75)
- Technology: