Developing a Comprehensive Framework for Sentiment Analysis in Turkish
–arXiv.org Artificial Intelligence
In this thesis, we developed a comprehensive framework for sentiment analysis that takes its many aspects into account mainly for Turkish. We have also proposed several approaches specific to sentiment analysis in English only. We have accordingly made five major and three minor contributions. We generated a novel and effective feature set by combining unsupervised, semi-supervised, and supervised metrics. We then fed them as input into classical machine learning methods, and outperformed neural network models for datasets of different genres in both Turkish and English. We created a polarity lexicon with a semi-supervised domain-specific method, which has been the first approach applied for corpora in Turkish. We performed a fine morphological analysis for the sentiment classification task in Turkish by determining the polarities of morphemes. This can be adapted to other morphologically-rich or agglutinative languages as well. We have built a novel neural network architecture, which combines recurrent and recursive neural network models for English. We built novel word embeddings that exploit sentiment, syntactic, semantic, and lexical characteristics for both Turkish and English. We also redefined context windows as subclauses in modelling word representations in English. This can also be applied to other linguistic fields and natural language processing tasks. We have achieved state-of-the-art and significant results for all these original approaches. Our minor contributions include methods related to aspect-based sentiment in Turkish, parameter redefinition in the semi-supervised approach, and aspect term extraction techniques for English. This thesis can be considered the most detailed and comprehensive study made on sentiment analysis in Turkish as of July, 2020. Our work has also contributed to the opinion classification problem in English.
arXiv.org Artificial Intelligence
Dec-2-2025
- Country:
- Asia
- China > Beijing
- Beijing (0.04)
- India (0.04)
- Middle East
- Jordan (0.04)
- Qatar > Ad-Dawhah
- Doha (0.04)
- Republic of Türkiye
- Ankara Province > Ankara (0.04)
- Mersin Province > Mersin (0.04)
- Russia (0.04)
- South Korea (0.04)
- Vietnam (0.04)
- China > Beijing
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Middle East
- Cyprus > Limassol
- Limassol (0.04)
- Malta > Port Region
- Southern Harbour District > Valletta (0.04)
- Cyprus > Limassol
- Russia (0.04)
- France (0.04)
- Slovenia (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Italy > Liguria
- Genoa (0.04)
- Ukraine > Crimea (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Spain
- Galicia > Madrid (0.04)
- Region of Murcia > Murcia (0.04)
- Germany > Berlin (0.04)
- Switzerland > Zürich
- Zürich (0.13)
- Belgium > Brussels-Capital Region
- North America
- Canada
- United States
- New York > New York County
- New York City (0.04)
- Alaska > Anchorage Municipality
- Anchorage (0.04)
- Washington > King County
- Seattle (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Maryland
- Baltimore (0.04)
- Prince George's County > College Park (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Colorado > Denver County
- Denver (0.04)
- California > San Diego County
- San Diego (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.13)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Texas > Travis County
- Austin (0.04)
- New York > New York County
- Oceania > New Zealand
- North Island > Auckland Region > Auckland (0.04)
- Asia
- Genre:
- Overview (1.00)
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Promising Solution (0.87)
- Industry:
- Information Technology > Services (0.67)
- Leisure & Entertainment (0.93)
- Media > Film (0.93)
- Technology: