A Corpus for Sentence-level Subjectivity Detection on English News Articles
Antici, Francesco, Galassi, Andrea, Ruggeri, Federico, Korre, Katerina, Muti, Arianna, Bardi, Alessandra, Fedotova, Alice, Barrón-Cedeño, Alberto
–arXiv.org Artificial Intelligence
We present a novel corpus for subjectivity detection at the sentence level. We develop new annotation guidelines for the task, which are not limited to language-specific cues, and apply them to produce a new corpus in English. The corpus consists of 411 subjective and 638 objective sentences extracted from ongoing coverage of political affairs from online news outlets. This new resource paves the way for the development of models for subjectivity detection in English and across other languages, without relying on language-specific tools like lexicons or machine translation. We evaluate state-of-the-art multilingual transformer-based models on the task, both in mono- and cross-lingual settings, the latter with a similar existing corpus in Italian language. We observe that enriching our corpus with resources in other languages improves the results on the task.
arXiv.org Artificial Intelligence
May-29-2023
- Country:
- South America > Brazil (0.04)
- Oceania > Australia
- North America
- Dominican Republic (0.04)
- United States
- New York (0.04)
- Washington > King County
- Seattle (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Maryland > Prince George's County
- College Park (0.04)
- Canada > British Columbia
- Europe
- Russia (0.14)
- Ukraine (0.04)
- Portugal (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Bulgaria > Varna Province
- Varna (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Spain
- Valencian Community > Alicante Province
- Alicante (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Valencian Community > Alicante Province
- Netherlands
- South Holland > Dordrecht (0.04)
- North Holland > Amsterdam (0.04)
- Italy
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Russia (0.46)
- Singapore (0.04)
- South Korea (0.04)
- India (0.04)
- Thailand > Chiang Mai
- Chiang Mai (0.04)
- Japan > Kyūshū & Okinawa
- Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- China
- Genre:
- Research Report (0.50)
- Industry:
- Media > News (1.00)
- Health & Medicine > Therapeutic Area
- Infections and Infectious Diseases (0.68)
- Immunology (0.46)
- Technology: