When Large Language Models Meet Speech: A Survey on Integration Approaches

Yang, Zhengdong, Shimizu, Shuichiro, Yu, Yahan, Chu, Chenhui

Feb-26-2025–arXiv.org Artificial Intelligence

Recent advancements in large language models (LLMs) have spurred interest in expanding their application beyond text-based tasks. A large number of studies have explored integrating other modalities with LLMs, notably speech modality, which is naturally related to text. This paper surveys the integration of speech with LLMs, categorizing the methodologies into three primary approaches: text-based, latent-representation-based, and audio-token-based integration.

arxiv preprint arxiv, international conference, language model, (11 more...)

arXiv.org Artificial Intelligence

Feb-26-2025

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Rhode Island (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Hawaii > Honolulu County
      - Honolulu (0.04)
    - Florida > Miami-Dade County
      - Miami (0.04)
    - California > San Diego County
      - San Diego (0.04)
  - Mexico > Mexico City
    - Mexico City (0.04)
  - Canada
    - Ontario > Toronto (0.04)
    - British Columbia > Vancouver (0.04)
- Europe
  - Austria > Vienna (0.14)
  - Greece (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
- Asia
  - Singapore (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
  - South Korea
    - Seoul > Seoul (0.04)
    - Incheon > Incheon (0.04)
    - Gyeonggi-do > Suwon (0.04)
  - Japan > Honshū
    - Tōhoku > Iwate Prefecture
      - Morioka (0.04)
    - Kansai > Kyoto Prefecture
      - Kyoto (0.04)
    - Chūbu > Aichi Prefecture
      - Nagoya (0.04)
  - China > Shanghai
    - Shanghai (0.04)

Genre:
- Research Report (1.00)
- Overview (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found