Developing Corpora for Sentiment Analysis: The Case of Irony and Senti-TUT (Extended Abstract)
Bosco, Cristina (Dipartimento di Informatica, Università di Torino) | Patti, Viviana (Dipartimento di Informatica, Università di Torino) | Bolioli, Andrea (CELI srl)
This paper focusses on the main issues related to the development of a corpus for opinion and sentiment analysis, with a special attention to irony, and presents as a case study Senti-TUT, a project for Italian aimed at investigating sentiment and irony in social media. We present the Senti-TUT corpus, a collection of texts from Twitter annotated with sentiment polarity. We describe the dataset, the annotation, the methodologies applied and our investigations on two important features of irony: polarity reversing and emotion expressions.
- Country:
- North America > United States
- New York (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Europe
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.05)
- Italy
- Tuscany > Pisa Province
- Pisa (0.04)
- Piedmont > Turin Province
- Turin (0.04)
- Tuscany > Pisa Province
- Iceland > Capital Region
- Reykjavik (0.04)
- Spain > Catalonia
- Asia > Middle East
- Republic of Türkiye > Istanbul Province > Istanbul (0.05)
- North America > United States
- Industry:
- Government (0.46)
- Technology: