The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models

Zhu, Xiliang, Gardiner, Shayna, Roldán, Tere, Rossouw, David

Jun-27-2024–arXiv.org Artificial Intelligence

Sentiment analysis serves as a pivotal component in Natural Language Processing (NLP). Advancements in multilingual pre-trained models such as XLM-R and mT5 have contributed to the increasing interest in cross-lingual sentiment analysis. The recent emergence in Large Language Models (LLM) has significantly advanced general NLP tasks, however, the capability of such LLMs in cross-lingual sentiment analysis has not been fully studied. This work undertakes an empirical analysis to compare the cross-lingual transfer capability of public Small Multilingual Language Models (SMLM) like XLM-R, against English-centric LLMs such as Llama-3, in the context of sentiment analysis across English, Spanish, French and Chinese. Our findings reveal that among public models, SMLMs exhibit superior zero-shot cross-lingual performance relative to LLMs. However, in few-shot cross-lingual settings, public LLMs demonstrate an enhanced adaptive potential. In addition, we observe that proprietary GPT-3.5 and GPT-4 lead in zero-shot cross-lingual capability, but are outpaced by public models in few-shot scenarios.

computational linguistic, sentiment analysis, target language, (13 more...)

arXiv.org Artificial Intelligence

Jun-27-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Oregon > Multnomah County
    - Portland (0.04)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
- Europe
  - Monaco (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
- Asia
  - South Korea (0.04)
  - Singapore (0.04)
  - Indonesia > Bali (0.04)
  - Middle East
    - Jordan (0.04)
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.14)

Genre:
- Research Report > New Finding (0.88)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found