Sentiment Analysis in Twitter for Macedonian
Jovanoski, Dame, Pachovski, Veno, Nakov, Preslav
–arXiv.org Artificial Intelligence
We present work on sentiment analysis in Twitter for Macedonian. As this is pioneering work for this combination of language and genre, we created suitable resources for training and evaluating a system for sentiment analysis of Macedonian tweets. In particular, we developed a corpus of tweets annotated with tweet-level sentiment polarity (positive, negative, and neutral), as well as with phrase-level sentiment, which we made freely available for research purposes. We further bootstrapped several large-scale sentiment lexicons for Macedonian, motivated by previous work for English. The impact of several different pre-processing steps as well as of various features is shown in experiments that represent the first attempt to build a system for sentiment analysis in Twitter for the morphologically rich Macedonian language. Overall, our experimental results show an F1-score of 92.16, which is very strong and is on par with the best results for English, which were achieved in recent SemEval competitions.
arXiv.org Artificial Intelligence
Sep-27-2021
- Country:
- North America
- United States
- District of Columbia > Washington (0.04)
- Washington > King County
- Seattle (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- New Jersey > Bergen County
- Mahwah (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Colorado > Denver County
- Denver (0.05)
- California > Los Angeles County
- Los Angeles (0.14)
- Canada > British Columbia
- United States
- Europe
- Bulgaria (0.04)
- Spain (0.04)
- United Kingdom > England
- Greater Manchester > Manchester (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- North Macedonia > Skopje Statistical Region
- Skopje Municipality > Skopje (0.04)
- Middle East > Malta
- Port Region > Southern Harbour District > Valletta (0.04)
- Italy > Liguria
- Genoa (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.05)
- Asia
- North America
- Genre:
- Research Report > New Finding (0.49)
- Industry:
- Information Technology > Services (0.46)
- Technology: