Dramatic Conversation Disentanglement
Chang, Kent K., Chen, Danica, Bamman, David
–arXiv.org Artificial Intelligence
We present a new dataset for studying conversation disentanglement in movies and TV series. While previous work has focused on conversation disentanglement in IRC chatroom dialogues, movies and TV shows provide a space for studying complex pragmatic patterns of floor and topic change in face-to-face multi-party interactions. In this work, we draw on theoretical research in sociolinguistics, sociology, and film studies to operationalize a conversational thread (including the notion of a floor change) in dramatic texts, and use that definition to annotate a dataset of 10,033 dialogue turns (comprising 2,209 threads) from 831 movies. We compare the performance of several disentanglement models on this dramatic dataset, and apply the best-performing model to disentangle 808 movies. We see that, contrary to expectation, average thread lengths do not decrease significantly over the past 40 years, and characters portrayed by actors who are women, while underrepresented, initiate more new conversational threads relative to their speaking time.
arXiv.org Artificial Intelligence
May-26-2023
- Country:
- North America
- United States
- Indiana (0.04)
- Connecticut (0.04)
- Illinois (0.04)
- Colorado > Denver County
- Denver (0.04)
- Ohio > Franklin County
- Columbus (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Washington > King County
- Seattle (0.04)
- California
- San Diego County > San Diego (0.04)
- Alameda County > Berkeley (0.04)
- New York > New York County
- New York City (0.14)
- Canada > British Columbia
- United States
- Europe
- Italy (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Greater London > London (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Asia
- Vietnam > Long An Province (0.04)
- Singapore (0.04)
- China > Hong Kong (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Japan > Honshū
- Kantō > Kanagawa Prefecture > Yokohama (0.04)
- India > Karnataka
- Bengaluru (0.04)
- North America
- Genre:
- Research Report (0.82)
- Industry:
- Leisure & Entertainment (1.00)
- Education (0.92)
- Media
- Television (1.00)
- Film (1.00)
- Technology: