Evaluating and explaining training strategies for zero-shot cross-lingual news sentiment analysis
Andrenšek, Luka, Koloski, Boshko, Pelicon, Andraž, Lavrač, Nada, Pollak, Senja, Purver, Matthew
–arXiv.org Artificial Intelligence
We investigate zero-shot cross-lingual news sentiment detection, aiming to develop robust sentiment classifiers that can be deployed across multiple languages without target-language training data. We introduce novel evaluation datasets in several less-resourced languages, and experiment with a range of approaches including the use of machine translation; in-context learning with large language models; and various intermediate training regimes including a novel task objective, POA, that leverages paragraph-level information. Our results demonstrate significant improvements over the state of the art, with in-context learning generally giving the best performance, but with the novel POA approach giving a competitive alternative with much lower computational overhead. We also show that language similarity is not in itself sufficient for predicting the success of cross-lingual transfer, but that similarity in semantic content and structure can be equally important.
arXiv.org Artificial Intelligence
Sep-30-2024
- Country:
- South America > Colombia
- Meta Department > Villavicencio (0.04)
- North America > United States
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts > Suffolk County
- Boston (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Minnesota > Hennepin County
- Europe
- Monaco (0.04)
- Kosovo (0.04)
- Estonia (0.04)
- Croatia (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Greater London > London (0.04)
- Slovenia > Central Slovenia
- Municipality of Ljubljana > Ljubljana (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Asia
- South America > Colombia
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (0.93)
- Research Report
- Technology: