AITopics | turn-taking

Collaborating Authors

turn-taking

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Triadic Multi-party Voice Activity Projection for Turn-taking in Spoken Dialogue Systems

Elmers, Mikey, Inoue, Koji, Lala, Divesh, Kawahara, Tatsuya

arXiv.org Artificial IntelligenceOct-6-2025

Turn-taking is a fundamental component of spoken dialogue, however conventional studies mostly involve dyadic settings. This work focuses on applying voice activity projection (VAP) to predict upcoming turn-taking in triadic multi-party scenarios. The goal of VAP models is to predict the future voice activity for each speaker utilizing only acoustic data. This is the first study to extend VAP into triadic conversation. We trained multiple models on a Japanese triadic dataset where participants discussed a variety of topics. We found that the VAP trained on triadic conversation outperformed the baseline for all models but that the type of conversation affected the accuracy. This study establishes that VAP can be used for turn-taking in triadic dialogue scenarios. Future work will incorporate this triadic VAP turn-taking model into spoken dialogue systems.

artificial intelligence, natural language, voice activity, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.21437/Interspeech.2025-2660

2507.07518

Country: Asia > Japan > Honshū (0.14)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)

Add feedback

Beyond Turn-taking: Introducing Text-based Overlap into Human-LLM Interactions

Kim, JiWoo, Chang, Minsuk, Bak, JinYeong

arXiv.org Artificial IntelligenceJan-29-2025

Traditional text-based human-AI interactions often adhere to a strict turn-taking approach. In this research, we propose a novel approach that incorporates overlapping messages, mirroring natural human conversations. Through a formative study, we observed that even in text-based contexts, users instinctively engage in overlapping behaviors like "A: Today I went to-" "B: yeah." To capitalize on these insights, we developed OverlapBot, a prototype chatbot where both AI and users can initiate overlapping. Our user study revealed that OverlapBot was perceived as more communicative and immersive than traditional turn-taking chatbot, fostering faster and more natural interactions. Our findings contribute to the understanding of design space for overlapping interactions. We also provide recommendations for implementing overlap-capable AI interactions to enhance the fluidity and engagement of text-based conversations.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2501.18103

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Austria > Vienna (0.14)
(29 more...)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

Wang, Jinhan, Chen, Long, Khare, Aparna, Raju, Anirudh, Dheram, Pranav, He, Di, Wu, Minhua, Stolcke, Andreas, Ravichandran, Venkatesh

arXiv.org Artificial IntelligenceJan-26-2024

We propose an approach for continuous prediction of turn-taking and backchanneling locations in spoken dialogue by fusing a neural acoustic model with a large language model (LLM). Experiments on the Switchboard human-human conversation dataset demonstrate that our approach consistently outperforms the baseline models with single modality. We also develop a novel multi-task instruction fine-tuning strategy to further benefit from LLM-encoded knowledge for understanding the tasks and conversational contexts, leading to additional improvements. Our approach demonstrates the potential of combined LLMs and acoustic models for a more natural and conversational interaction between humans and speech-enabled AI agents.

backchannel, fine-tuning, prediction, (15 more...)

arXiv.org Artificial Intelligence

2401.14717

Country: North America > United States > California > Los Angeles County > Los Angeles (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Winston

AAAI ConferencesFeb-8-2022, 09:30:54 GMT

Turn-taking is the ability for agents to lead or follow in social interactions. Turn-taking between humans and intelligent agents has been studied in human-robot interaction but has not been applied to improvisational, dance-based interactions. User understanding and experience of turn-taking in an improvisational, dance-based system known as LuminAI was investigated in a preliminary study of 11 participants. The results showed a trend towards users understanding the difference between turn-taking and non-turn-taking versions of LuminAI but reduced user experience in the turn-taking version.

interaction, turn-taking, winston, (1 more...)

AAAI Conferences

Genre: Research Report > New Finding (1.00)

Technology: Information Technology > Artificial Intelligence > Robots (0.75)

Add feedback

Turn-Taking with Improvisational Co-Creative Agents

Winston, Lauren (Georgia Institute of Technology) | Magerko, Brian (Georgia Institute of Technology)

AAAI ConferencesOct-1-2017

artificial intelligence, improvisational co-creative agent, turn-taking

AAAI Conferences

Thirteenth Artificial Intelligence and Interactive Digital Entertainment Conference

Genre: Research Report > New Finding (1.00)

Technology: Information Technology > Artificial Intelligence > Robots (0.53)

Add feedback

Turn-Taking, Children, and the Unpredictability of Fun

Lehman, Jill Fain (Disney Research) | Leite, Iolanda (Disney Research)

AI MagazineJan-17-2017

When the underlying assumptions of commonality of purpose and content break down, the interaction does as well. A great deal of the art of interaction design lies in minimizing what is, from the agent's point of view, out-of-task behavior, both by anticipating natural intask communication and by providing cues to lead participants down the predicted paths. Anticipation and cueing are particularly important in designing interactions for young children, a population that is limited in its ability to understand and adapt to the bounds of a system when things go awry. Most speech and natural language research that focuses on this population has pedagogy (Ogan et al. 2012; Gordon and Breazeal 2015) or therapy As explained briefly by Edith, there are two main game actions: effecting a change to the model by naming one of the clothing items or accessories on the board, and requesting a picture of the increasingly crazily clad model to be printed and taken home afterward. The majority of the interaction consists of 20 choice cycles during each of which a valid reference to a board item is made, the model changes, and a replacement item appears.

artificial intelligence, human computer interaction, natural language, (18 more...)

AI Magazine

Country:

Europe (0.68)
North America > United States > California (0.28)

Industry: Leisure & Entertainment > Games (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Robots (0.95)
(2 more...)

Add feedback

Turn-Taking in Commander-Robot Navigator Dialog (Video Abstract)

Cassidy, Taylor (US Army Research Laboratory) | Voss, Clare (US Army Research Laboratory) | Summers-Stay, Douglas (US Army Research Laboratory)

AAAI ConferencesMar-16-2015

The accompanying video captures the multi-modal data displays and speech dialogue of a human Commander (C) and a human Robot Navigator (RN) tele-operating a mobile robot (R) in a remote, previously unexplored area. We describe unique challenges for automation of turn-taking and coordination processes observed in the data.

artificial intelligence, commander-robot navigator dialog, video abstract, (1 more...)

AAAI Conferences

2015 AAAI Spring Symposium Series

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Turn-Taking in Commander-Robot Navigator Dialog

Cassidy, Taylor (US Army Research Laboratory) | Voss, Clare (US Army Research Laboratory) | Summers-Stay, Douglas (US Army Research Laboratory)

AAAI ConferencesMar-16-2015

We seek to develop a robot that will be capable of teaming with humans to accomplish physical exploration tasks that would not otherwise be possible in dynamic, dangerous environments. For such tasks, a human commander needs to be able to communicate with a robot that moves out of sight and relays information back to the commander. What is the best way to determine how a human commander would interact in a multi-modal spoken dialog with such a robot to accomplish tasks? In this paper, we describe our initial approach to discovering a principled basis for coordinating turn-taking, perception, and navigational behavior of a robot in communication with a commander, by identifying decision phases in dialogs collected in a WoZ framework. We present two types of utterance annotation with examples applied to task-oriented dialog between a human commander and a human ``robot navigator'' who controls the physical robot in a realistic environment similar to expected actual conditions. We discuss core robot capabilities that bear on the robot navigator's ability to take turns while performing a ``find the building doors'' task at hand. The paper concludes with a brief overview of ongoing work to implement these decision phases within an open-source dialog management framework, constructing a task tree specification and dialog control logic for our application domain.

artificial intelligence, commander-robot navigator dialog, turn-taking

AAAI Conferences

2015 AAAI Spring Symposium Series

Genre: Overview (0.53)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Signalizing and Predicting Turn-Taking in Multilingual Contexts: Using Data from Transcribed International Spoken Journalistic Texts in Human-Robot Interaction

Alexandris, Christina (National University of Athens)

AAAI ConferencesMar-16-2015

Data from transcribed spoken journalistic texts from international news networks is employed in the signalization and prediction of turn-taking in Human-Computer Interaction and Human-Robot Interaction in multilingual contexts, taking into account the verbal and non-verbal behavior of international speakers.

artificial intelligence, human computer interaction, transcribed international spoken journalistic text, (3 more...)

AAAI Conferences

2015 AAAI Spring Symposium Series

Technology:

Information Technology > Human Computer Interaction (1.00)
Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.60)

Add feedback

Turn Taking for Human-Robot Interaction

Chao, Crystal (Georgia Institute of Technology) | Thomaz, Andrea Lockerd ( Georgia Institute of Technology )

AAAI ConferencesNov-5-2010

Applications in Human-Robot Interaction (HRI) in the not-so-distant future include robots that collaborate with factory workers or serve us as caregivers or waitstaff. When offering customized functionality in these dynamic environments, robots need to engage in real-time exchanges with humans. Robots thus need to be capable of participating in smooth turn-taking interactions. The research goal in HRI of unstructured dialogic interaction would allow communication with robots that is as natural as communication with other humans. Turn-taking is the framework that provides structure for human communication. Consciously or subconsciously, humans are able to communicate their understanding and control of the turn structure to a conversation partner by using syntax, semantics, paralinguistic cues, eye gaze, and body language in a socially intelligent way. Our research aims to show that by implementing these turn-taking cues within a interaction architecture that is designed fundamentally for turn-taking, a robot becomes easier and more efficient for a human to interact with. This paper outlines our approach and initial pilot study into this line of research.

artificial intelligence, interaction, robot, (17 more...)

AAAI Conferences

2010 AAAI Fall Symposium Series

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.85)

Add feedback