Interactive Concept Learning for Uncovering Latent Themes in Large Text Collections
Pacheco, Maria Leonor, Islam, Tunazzina, Ungar, Lyle, Yin, Ming, Goldwasser, Dan
–arXiv.org Artificial Intelligence
Experts across diverse disciplines are often interested in making sense of large text collections. Traditionally, this challenge is approached either by noisy unsupervised techniques such as topic models, or by following a manual theme discovery process. In this paper, we expand the definition of a theme to account for more than just a word distribution, and include generalized concepts deemed relevant by domain experts. Then, we propose an interactive framework that receives and encodes expert feedback at different levels of abstraction. Our framework strikes a balance between automation and manual coding, allowing experts to maintain control of their study while reducing the manual effort required.
arXiv.org Artificial Intelligence
May-8-2023
- Country:
- North America
- Dominican Republic (0.04)
- Mexico (0.04)
- Honduras (0.04)
- United States
- Virginia (0.04)
- Pennsylvania (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Oregon > Multnomah County
- Portland (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Washington > King County
- Seattle (0.04)
- Massachusetts
- Suffolk County > Boston (0.04)
- Middlesex County > Cambridge (0.04)
- Colorado > Boulder County
- Boulder (0.04)
- New York > New York County
- New York City (0.04)
- Canada > British Columbia
- Europe
- United Kingdom
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England > Cambridgeshire
- Cambridge (0.04)
- Scotland > City of Edinburgh
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Czechia > South Moravian Region
- Brno (0.04)
- United Kingdom
- Asia
- Middle East > Jordan (0.05)
- South Korea (0.04)
- China
- North America
- Genre:
- Research Report (1.00)
- Industry:
- Technology: