Transfer Learning with Joint Fine-Tuning for Multimodal Sentiment Analysis

de Toledo, Guilherme Lourenço, Marcacini, Ricardo Marcondes

Oct-11-2022–arXiv.org Artificial Intelligence

Most existing methods focus on sentiment analysis of textual data. However, recently there has been a massive use of images and videos on social platforms, motivating sentiment analysis from other modalities. Current studies show that exploring other modalities (e.g., images) increases sentiment analysis performance. State-of-the-art multimodal models, such as CLIP and VisualBERT, are pre-trained on datasets with the text paired with images. Although the results obtained by these models are promising, pre-training and sentiment analysis fine-tuning tasks of these models are computationally expensive. This paper introduces a transfer learning approach using joint fine-tuning for sentiment analysis. Our proposal achieved competitive results using a more straightforward alternative fine-tuning strategy that leverages different pre-trained unimodal models and efficiently combines them in a multimodal space. Moreover, our proposal allows flexibility when incorporating any pre-trained model for texts and images during the joint fine-tuning stage, being especially interesting for sentiment classification in low-resource scenarios.

artificial intelligence, natural language, sentiment analysis, (14 more...)

arXiv.org Artificial Intelligence

Oct-11-2022

arXiv.org PDF

Add feedback

Country:
- South America > Brazil
  - São Paulo (0.04)
- North America > United States
  - Maryland > Baltimore (0.04)

Genre:
- Research Report > New Finding (0.89)

Technology:
- Information Technology > Artificial Intelligence > Natural Language
  - Information Extraction (1.00)
  - Discourse & Dialogue (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found