Categorical Classification of Book Summaries Using Word Embedding Techniques
Keskin, Kerem, Keleş, Mümine Kaya
–arXiv.org Artificial Intelligence
In this study, book summaries and categories taken from book sites were classified using word embedding methods, natural language processing techniques and machine learning algorithms. In addition, one hot encoding, Word2Vec and Term Frequency - Inverse Document Frequency (TF - IDF) methods, which are frequently used word embedding methods were used in this study and their success was compared. Additionally, the combination table of the pre - processing methods used is shown and added to the table. Looking at the results, it was observed that Support Vector Machine, Naive Bayes and Logistic Regression Models and TF - IDF and One - Hot Encoder word embedding techniques gave more successful results for Turkish texts. Using word2vec to process big text data.
arXiv.org Artificial Intelligence
Jul-30-2025
- Country:
- Asia
- Afghanistan > Kabul Province
- Kabul (0.04)
- Indonesia (0.04)
- Middle East > Republic of Türkiye
- Adana Province > Adana (0.04)
- Afghanistan > Kabul Province
- Europe
- Germany > Berlin (0.04)
- Kosovo > District of Pristina
- Pristina (0.06)
- North America > United States
- Arizona > Maricopa County > Scottsdale (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.74)
- Technology: