A Topical Approach to Capturing Customer Insight In Social Media
–arXiv.org Artificial Intelligence
The age of social media has opened new opportunities for businesses. This flourishing wealth of information is outside traditional channels and frameworks of classical marketing research, including that of Marketing Mix Modeling (MMM). Textual data, in particular, poses many challenges that data analysis practitioners must tackle. Social media constitute massive, heterogeneous, and noisy document sources. Industrial data acquisition processes include some amount of ETL. However, the variability of noise in the data and the heterogeneity induced by different sources create the need for ad-hoc tools. Put otherwise, customer insight extraction in fully unsupervised, noisy contexts is an arduous task. This research addresses the challenge of fully unsupervised topic extraction in noisy, Big Data contexts. We present three approaches we built on the Variational Autoencoder framework: the Embedded Dirichlet Process, the Embedded Hierarchical Dirichlet Process, and the time-aware Dynamic Embedded Dirichlet Process. These nonparametric approaches concerning topics present the particularity of determining word embeddings and topic embeddings. These embeddings do not require transfer learning, but knowledge transfer remains possible. We test these approaches on benchmark and automotive industry-related datasets from a real-world use case. We show that our models achieve equal to better performance than state-of-the-art methods and that the field of topic modeling would benefit from improved evaluation metrics.
arXiv.org Artificial Intelligence
Jul-14-2023
- Country:
- South America
- Chile > Santiago Metropolitan Region
- Santiago Province > Santiago (0.04)
- Brazil > Rio de Janeiro
- Rio de Janeiro (0.04)
- Chile > Santiago Metropolitan Region
- North America
- United States
- District of Columbia > Washington (0.04)
- Pennsylvania
- Philadelphia County > Philadelphia (0.04)
- Allegheny County > Pittsburgh (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- New York > New York County
- New York City (0.04)
- Florida > Broward County
- Fort Lauderdale (0.04)
- California > Alameda County
- Berkeley (0.04)
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Slovakia > Žilina
- Žilina (0.04)
- Romania > București - Ilfov Development Region
- Municipality of Bucharest > Bucharest (0.04)
- Germany
- Berlin (0.04)
- Brandenburg > Potsdam (0.04)
- France > Auvergne-Rhône-Alpes
- Finland > Uusimaa
- Helsinki (0.04)
- United Kingdom > Scotland
- Asia
- Cambodia (0.14)
- Bangladesh (0.04)
- Middle East
- Jordan (0.04)
- Israel > Southern District (0.04)
- Japan > Honshū
- Kansai > Osaka Prefecture > Osaka (0.04)
- India > Telangana
- Hyderabad (0.04)
- China
- Africa
- Zimbabwe (0.04)
- Namibia (0.04)
- Mozambique (0.04)
- Middle East > Morocco (0.04)
- Burundi (0.04)
- Angola (0.04)
- Zambia > Lusaka Province
- Lusaka (0.04)
- South America
- Genre:
- Research Report
- New Finding (0.45)
- Promising Solution (0.34)
- Research Report
- Industry:
- Government (0.92)
- Transportation > Ground (0.45)
- Automobiles & Trucks > Manufacturer (0.45)
- Technology: