Bridging Domain Gaps between Pretrained Multimodal Models and Recommendations

Zhang, Wenyu, Luo, Jie, Zhang, Xinming, Fang, Yuan

Feb-21-2025–arXiv.org Artificial Intelligence

With the explosive growth of multimodal content online, pre-trained visual-language models have shown great potential for multimodal recommendation. However, while these models achieve decent performance when applied in a frozen manner, surprisingly, due to significant domain gaps (e.g., feature distribution discrepancy and task objective misalignment) between pre-training and personalized recommendation, adopting a joint training approach instead leads to performance worse than baseline. Existing approaches either rely on simple feature extraction or require computationally expensive full model fine-tuning, struggling to balance effectiveness and efficiency. To tackle these challenges, we propose \textbf{P}arameter-efficient \textbf{T}uning for \textbf{M}ultimodal \textbf{Rec}ommendation (\textbf{PTMRec}), a novel framework that bridges the domain gap between pre-trained models and recommendation systems through a knowledge-guided dual-stage parameter-efficient training strategy. This framework not only eliminates the need for costly additional pre-training but also flexibly accommodates various parameter-efficient tuning methods.

pre-trained model, recommendation, recommendation model, (14 more...)

arXiv.org Artificial Intelligence

Feb-21-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California > San Diego County > San Diego (0.04)
- Europe
  - Austria > Vienna (0.14)
  - Netherlands (0.04)
  - United Kingdom > England
    - Greater London > London (0.04)
  - Romania > Sud - Muntenia Development Region
    - Giurgiu County > Giurgiu (0.04)
- Asia
  - Singapore (0.04)
  - China > Anhui Province
    - Hefei (0.05)

Genre:
- Research Report (1.00)

Technology:
- Information Technology
  - Data Science > Data Mining (0.89)
  - Artificial Intelligence
    - Representation & Reasoning > Personal Assistant Systems (1.00)
    - Natural Language (1.00)
    - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found