Modeling Real-Time Interactive Conversations as Timed Diarized Transcripts
Tanzer, Garrett, Ahdritz, Gustaf, Melas-Kyriazi, Luke
–arXiv.org Artificial Intelligence
Chatbots built upon language models have exploded in popularity, but they have largely been limited to synchronous, turn-by-turn dialogues. In this paper we present a simple yet general method to simulate real-time interactive conversations using pretrained text-only language models, by modeling timed diarized transcripts and decoding them with causal rejection sampling. We demonstrate the promise of this method with two case studies: instant messenger dialogues and spoken conversations, which require generation at about 30 tok/s and 20 tok/s respectively to maintain real-time interactivity. These capabilities can be added into language models using relatively little data and run on commodity hardware.
arXiv.org Artificial Intelligence
May-21-2024
- Country:
- Asia > Japan
- Honshū (0.14)
- North America > United States (0.46)
- Asia > Japan
- Genre:
- Research Report (0.50)
- Industry:
- Health & Medicine (0.67)
- Information Technology (0.92)
- Technology: