Structurally Refined Graph Transformer for Multimodal Recommendation
Shi, Ke, Zhang, Yan, Zhang, Miao, Chen, Lifan, Yi, Jiali, Xiao, Kui, Hou, Xiaoju, Li, Zhifei
–arXiv.org Artificial Intelligence
Abstract--Multimodal recommendation systems utilize various types of information, including images and text, to enhance the effectiveness of recommendations. The key challenge is predicting user purchasing behavior from the available data. They also rely heavily on a single semantic framework (e.g., local or global semantics), resulting in an incomplete or biased representation of user preferences, particularly those less expressed in prior interactions. Furthermore, these approaches fail to capture the complex interactions between users and items limiting the model's ability to meet diverse users. T o address these challenges, we present SRGFormer, a structurally optimized multimodal recommendation model. By modifying the transformer for better integration into our model, we capture the overall behavior patterns of users. Then, we enhance structural information by embedding multimodal information into a hypergraph structure to aid in learning the local structures between users and items. Meanwhile, applying self-supervised tasks to user-item collaborative signals enhances the integration of multimodal information, thereby revealing the representational features inherent to the data's modality. Extensive experiments on three public datasets reveal that SRGFormer surpasses previous benchmark models, achieving an average performance improvement of 4.47% on the Sports dataset. The swift growth of online data has led platforms to implement multimodal recommendation systems, initially using collaborative filtering (CF) to analyze user preferences from historical interactions [1], [2]. However, CF struggles to handle sparse or non-existent interaction records leading to less accurate predictions.
arXiv.org Artificial Intelligence
Nov-4-2025
- Country:
- Asia > China
- Guangdong Province > Guangzhou (0.04)
- Hubei Province > Wuhan (0.05)
- Shandong Province > Jinan (0.04)
- Europe > Greece
- Asia > China
- Genre:
- Research Report (1.00)
- Industry:
- Information Technology (0.93)