M2SA: Multimodal and Multilingual Model for Sentiment Analysis of Tweets
Thakkar, Gaurish, Hakimov, Sherzod, Tadić, Marko
–arXiv.org Artificial Intelligence
In recent years, multimodal natural language processing, aimed at learning from diverse data types, has garnered significant attention. However, there needs to be more clarity when it comes to analysing multimodal tasks in multi-lingual contexts. While prior studies on sentiment analysis of tweets have predominantly focused on the English language, this paper addresses this gap by transforming an existing textual Twitter sentiment dataset into a multimodal format through a straightforward curation process. Our work opens up new avenues for sentiment-related research within the research community. Additionally, we conduct baseline experiments utilising this augmented dataset and report the findings. Notably, our evaluations reveal that when comparing unimodal and multimodal configurations, using a sentiment-tuned large language model as a text encoder performs exceptionally well.
arXiv.org Artificial Intelligence
Jun-12-2024
- Country:
- South America > Colombia (0.04)
- North America
- United States
- Minnesota > Hennepin County
- Minneapolis (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Georgia > Fulton County
- Atlanta (0.05)
- Colorado > Denver County
- Denver (0.04)
- California > Orange County
- Laguna Hills (0.04)
- Minnesota > Hennepin County
- Canada > Quebec
- Montreal (0.04)
- United States
- Europe
- Middle East > Malta (0.04)
- Spain (0.04)
- Croatia > Zagreb County
- Zagreb (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- Germany > Brandenburg
- Potsdam (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Sweden > Östergötland County
- Linköping (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Estonia > Tartu County
- Tartu (0.04)
- Asia > Taiwan
- Taiwan Province > Taipei (0.04)
- Africa > Central African Republic
- Ombella-M'Poko > Bimbo (0.04)
- Genre:
- Research Report (0.50)
- Industry:
- Health & Medicine > Therapeutic Area (0.68)
- Information Technology > Services (0.46)
- Technology: