Comparison of Topic Modelling Approaches in the Banking Context
Ogunleye, Bayode, Maswera, Tonderai, Hirsch, Laurence, Gaudoin, Jotham, Brunsdon, Teresa
–arXiv.org Artificial Intelligence
Topic modelling is a prominent task for automatic topic extraction in many applications such as sentiment analysis and recommendation systems. The approach is vital for service industries to monitor their customer discussions. The use of traditional approaches such as Latent Dirichlet Allocation (LDA) for topic discovery has shown great performances, however, they are not consistent in their results as these approaches suffer from data sparseness and inability to model the word order in a document. Thus, this study presents the use of Kernel Principal Component Analysis (KernelPCA) and K-means Clustering in the BERTopic architecture. We have prepared a new dataset using tweets from customers of Nigerian banks and we use this to compare the topic modelling approaches. Our findings showed KernelPCA and K-means in the BERTopic architecture-produced coherent topics with a coherence score of 0.8463.
arXiv.org Artificial Intelligence
Feb-5-2024
- Country:
- Africa
- Equatorial Guinea (0.04)
- Ghana (0.04)
- Lesotho (0.04)
- Middle East > Morocco (0.04)
- Nigeria > Federal Capital Territory
- Abuja (0.04)
- Sierra Leone (0.04)
- Asia
- China
- India
- Maharashtra > Mumbai (0.04)
- Telangana > Hyderabad (0.04)
- Japan
- Honshū > Chūgoku
- Okayama Prefecture > Okayama (0.04)
- Kyūshū & Okinawa > Kyūshū
- Fukuoka Prefecture > Fukuoka (0.04)
- Honshū > Chūgoku
- Middle East
- Jordan (0.04)
- Palestine > Gaza Strip
- Gaza Governorate > Gaza (0.04)
- Singapore (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Estonia > Harju County
- Tallinn (0.04)
- France > Auvergne-Rhône-Alpes
- Germany > Baden-Württemberg
- Karlsruhe Region > Heidelberg (0.04)
- Italy > Tuscany
- Florence (0.04)
- Middle East > Malta
- Port Region > Southern Harbour District > Valletta (0.04)
- Switzerland > Basel-City
- Basel (0.04)
- United Kingdom > England
- South Yorkshire > Sheffield (0.04)
- Belgium > Brussels-Capital Region
- North America
- Canada > British Columbia
- United States
- California
- Alameda County > Berkeley (0.04)
- Los Angeles County > Los Angeles (0.14)
- Florida > Broward County
- Hollywood (0.04)
- Maryland > Baltimore (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York > New York County
- New York City (0.04)
- Texas > Brazos County
- College Station (0.04)
- California
- South America > Brazil
- Rio de Janeiro > Rio de Janeiro (0.04)
- Africa
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Banking & Finance (1.00)
- Consumer Products & Services > Travel (0.46)
- Education > Educational Setting
- Online (0.68)
- Government (0.68)
- Information Technology > Services (0.67)
- Technology: