Leveraging IndoBERT and DistilBERT for Indonesian Emotion Classification in E-Commerce Reviews
Christian, William, Adamlu, Daniel, Yu, Adrian, Suhartono, Derwin
–arXiv.org Artificial Intelligence
Understanding emotions in the Indonesian language is essential for improving customer experiences in e-commerce. This study focuses on enhancing the accuracy of emotion classification in Indonesian by leveraging advanced language models, IndoBERT and DistilBERT. A key component of our approach was data processing, specifically data augmentation, which included techniques such as back-translation and synonym replacement. These methods played a significant role in boosting the model's performance. After hyperparameter tuning, IndoBERT achieved an accuracy of 80\%, demonstrating the impact of careful data processing. While combining multiple IndoBERT models led to a slight improvement, it did not significantly enhance performance. Our findings indicate that IndoBERT was the most effective model for emotion classification in Indonesian, with data augmentation proving to be a vital factor in achieving high accuracy. Future research should focus on exploring alternative architectures and strategies to improve generalization for Indonesian NLP tasks.
arXiv.org Artificial Intelligence
Sep-19-2025
- Country:
- Asia > Indonesia
- Borneo > Kalimantan
- East Kalimantan > Nusantara (0.04)
- Java > Jakarta
- Jakarta (0.04)
- Borneo > Kalimantan
- Asia > Indonesia
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Information Technology > Services > e-Commerce Services (0.61)
- Technology: