A Simple Approach to Multilingual Polarity Classification in Twitter
Tellez, Eric S., Jiménez, Sabino Miranda, Graff, Mario, Moctezuma, Daniela, Suárez, Ranyart R., Siordia, Oscar S.
Recently, sentiment analysis has received a lot of attention due to the interest in mining opinions of social media users. Sentiment analysis consists in determining the polarity of a given text, i.e., its degree of positiveness or negativeness. Traditionally, Sentiment Analysis algorithms have been tailored to a specific language given the complexity of having a number of lexical variations and errors introduced by the people generating content. In this contribution, our aim is to provide a simple to implement and easy to use multilingual framework, that can serve as a baseline for sentiment analysis contests, and as starting point to build new sentiment analysis systems. We compare our approach in eight different languages, three of them have important international contests, namely, SemEval (English), TASS (Spanish), and SENTIPOLC (Italian). Within the competitions our approach reaches from medium to high positions in the rankings; whereas in the remaining languages our approach outperforms the reported results.
Dec-15-2016
- Country:
- South America > Chile
- North America
- Mexico (0.05)
- Canada (0.04)
- United States
- New York > New York County
- New York City (0.04)
- Colorado > Denver County
- Denver (0.04)
- California > San Diego County
- San Diego (0.04)
- New York > New York County
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Middle East
- Italy > Tuscany
- Pisa Province > Pisa (0.04)
- United Kingdom > England
- Asia > Middle East
- Republic of Türkiye > Istanbul Province > Istanbul (0.04)
- Genre:
- Research Report (0.82)
- Technology: