Spoken Conversational Agents with Large Language Models
Yang, Chao-Han Huck, Stolcke, Andreas, Heck, Larry
–arXiv.org Artificial Intelligence
Building on this, we will examine joint text-speech pre-training (Chiu et al., 2022; Bar-rault et al., 2023; Chen et al., 2022) methods, This section will provide a comprehensive look at how state-of-the-art voice-interfaced LLMs (Reid et al., 2024; Chu et al., Current Trends The current work in AI virtual assistants builds upon the voice-only systems of the last decade by leveraging LLMs to significantly improve the coverage and robustness of the spoken language understanding and dialogue state tracking components, in addition to substantial advancements in spoken language generation. It highlights recent advancements in multi-turn dialogue systems, encompassing both LLM-based open-domain dialogue (ODD) and task-oriented dialogue (TOD) systems, as well as relevant datasets and evaluation metrics.
arXiv.org Artificial Intelligence
Dec-3-2025
- Country:
- Asia
- North America
- Canada > Ontario
- Toronto (0.04)
- United States
- California > Santa Clara County
- Palo Alto (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > Santa Clara County
- Canada > Ontario
- Genre:
- Instructional Material > Course Syllabus & Notes (1.00)
- Overview (0.95)
- Industry:
- Education > Educational Setting (0.46)
- Technology: