Semantic-Driven Topic Modeling for Analyzing Creativity in Virtual Brainstorming
Mersha, Melkamu Abay, Kalita, Jugal
–arXiv.org Artificial Intelligence
Virtual brainstorming sessions have become a central component of collaborative problem solving, yet the large volume and uneven distribution of ideas often make it difficult to extract valuable insights efficiently. Manual coding of ideas is time-consuming and subjective, underscoring the need for automated approaches to support the evaluation of group creativity. In this study, we propose a semantic-driven topic modeling framework that integrates four modular components: transformer-based embeddings (Sentence-BERT), dimensionality reduction (UMAP), clustering (HDBSCAN), and topic extraction with refinement. We evaluate our approach on structured Zoom brainstorming sessions involving student groups tasked with improving their university. Results demonstrate that our model achieves higher topic coherence compared to established methods such as LDA, ETM, and BERTopic, with an average coherence score of 0.687 (CV), outperforming baselines by a significant margin. Beyond improved performance, the model provides interpretable insights into the depth and diversity of topics explored, supporting both convergent and divergent dimensions of group creativity. This work highlights the potential of embedding-based topic modeling for analyzing collaborative ideation and contributes an efficient and scalable framework for studying creativity in synchronous virtual meetings. Introduction Digital communication has become central to modern collaboration, with virtual meetings and conversational agents such as chatbots increasingly shaping how teams interact across geographical and cultural boundaries [1].
arXiv.org Artificial Intelligence
Sep-23-2025
- Country:
- Asia (0.46)
- North America > United States
- Colorado (0.14)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Education (0.48)
- Information Technology (0.46)
- Technology:
- Information Technology
- Data Science > Data Mining (1.00)
- Communications
- Social Media (1.00)
- Collaboration (1.00)
- Artificial Intelligence
- Representation & Reasoning (1.00)
- Natural Language > Text Processing (1.00)
- Machine Learning
- Statistical Learning > Clustering (0.47)
- Neural Networks > Deep Learning (0.35)
- Information Technology