Comparison of Topic Modelling Approaches in the Banking Context
Ogunleye, Bayode, Maswera, Tonderai, Hirsch, Laurence, Gaudoin, Jotham, Brunsdon, Teresa
–arXiv.org Artificial Intelligence
Topic modelling is a prominent task for automatic topic extraction in many applications such as sentiment analysis and recommendation systems. The approach is vital for service industries to monitor their customer discussions. The use of traditional approaches such as Latent Dirichlet Allocation (LDA) for topic discovery has shown great performances, however, they are not consistent in their results as these approaches suffer from data sparseness and inability to model the word order in a document. Thus, this study presents the use of Kernel Principal Component Analysis (KernelPCA) and K-means Clustering in the BERTopic architecture. We have prepared a new dataset using tweets from customers of Nigerian banks and we use this to compare the topic modelling approaches. Our findings showed KernelPCA and K-means in the BERTopic architecture-produced coherent topics with a coherence score of 0.8463.
arXiv.org Artificial Intelligence
Feb-5-2024
- Country:
- South America > Brazil
- Rio de Janeiro > Rio de Janeiro (0.04)
- North America
- United States
- Maryland > Baltimore (0.04)
- Texas > Brazos County
- College Station (0.04)
- New York > New York County
- New York City (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Florida > Broward County
- Hollywood (0.04)
- California
- Los Angeles County > Los Angeles (0.14)
- Alameda County > Berkeley (0.04)
- Canada > British Columbia
- United States
- Europe
- United Kingdom > England
- South Yorkshire > Sheffield (0.04)
- Switzerland > Basel-City
- Basel (0.04)
- Middle East > Malta
- Port Region > Southern Harbour District > Valletta (0.04)
- Italy > Tuscany
- Florence (0.04)
- Germany > Baden-Württemberg
- Karlsruhe Region > Heidelberg (0.04)
- France > Auvergne-Rhône-Alpes
- Estonia > Harju County
- Tallinn (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- United Kingdom > England
- Asia
- Singapore (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Middle East
- Jordan (0.04)
- Palestine > Gaza Strip
- Gaza Governorate > Gaza (0.04)
- Japan
- Kyūshū & Okinawa > Kyūshū
- Fukuoka Prefecture > Fukuoka (0.04)
- Honshū > Chūgoku
- Okayama Prefecture > Okayama (0.04)
- Kyūshū & Okinawa > Kyūshū
- India
- Telangana > Hyderabad (0.04)
- Maharashtra > Mumbai (0.04)
- China
- Africa
- Lesotho (0.04)
- Sierra Leone (0.04)
- Middle East > Morocco (0.04)
- Ghana (0.04)
- Equatorial Guinea (0.04)
- Nigeria > Federal Capital Territory
- Abuja (0.04)
- South America > Brazil
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Banking & Finance (1.00)
- Government (0.68)
- Information Technology > Services (0.67)
- Consumer Products & Services > Travel (0.46)
- Education > Educational Setting
- Online (0.68)
- Technology: