Analyzing Emotions in Bangla Social Media Comments Using Machine Learning and LIME
Paul, Bidyarthi, Rahman, SM Musfiqur, Biswas, Dipta, Hasan, Md. Ziaul, Hossain, Md. Zahid
–arXiv.org Artificial Intelligence
Research on understanding emotions in written language continues to expand, especially for understudied languages with distinctive regional expressions and cultural features, such as Bangla. This study examines emotion analysis using 22,698 social media comments from the EmoNoBa dataset. For language analysis, we employ machine learning models--Linear SVM, KNN, and Random Forest--with n-gram data from a TF-IDF vectorizer. We additionally investigated how PCA affects the reduction of dimensionality. Moreover, we utilized a BiLSTM model and AdaBoost to improve decision trees. To make our machine learning models easier to understand, we used LIME to explain the predictions of the AdaBoost classifier, which uses decision trees. With the goal of advancing sentiment analysis in languages with limited resources, our work examines various techniques to find efficient techniques for emotion identification in Bangla.
arXiv.org Artificial Intelligence
Jun-13-2025
- Country:
- Asia
- Bangladesh > Dhaka Division
- Dhaka District > Dhaka (0.04)
- Middle East > Israel (0.04)
- Bangladesh > Dhaka Division
- Europe > Denmark
- Capital Region > Copenhagen (0.04)
- North America
- Canada > Alberta
- Census Division No. 5
- Kneehill County (0.04)
- Starland County (0.04)
- Census Division No. 7 > Stettler County No. 6 (0.04)
- Census Division No. 8 > Red Deer County (0.04)
- Census Division No. 5
- United States > Massachusetts
- Suffolk County > Boston (0.04)
- Canada > Alberta
- Asia
- Genre:
- Research Report > Experimental Study (0.34)
- Industry:
- Health & Medicine > Therapeutic Area (0.68)
- Technology: