Overview of the Amphion Toolkit (v0.2)

Li, Jiaqi, Zhang, Xueyao, Wang, Yuancheng, He, Haorui, Wang, Chaoren, Wang, Li, Liao, Huan, Ao, Junyi, Xie, Zeyu, Huang, Yiqiao, Zhang, Junan, Wu, Zhizheng

Feb-11-2025–arXiv.org Artificial Intelligence

Amphion is an open-source toolkit for Audio, Music, and Speech Generation, designed to lower the entry barrier for junior researchers and engineers in these fields. It provides a versatile framework that supports a variety of generation tasks and models. In this report, we introduce Amphion v0.2, the second major release developed in 2024. This release features a 100K-hour open-source multilingual dataset, a robust data preparation pipeline, and novel models for tasks such as text-to-speech, audio coding, and voice conversion. Furthermore, the report includes multiple tutorials that guide users through the functionalities and usage of the newly released models.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

Feb-11-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Pennsylvania > Philadelphia County
    - Philadelphia (0.04)
  - New York > New York County
    - New York City (0.04)
  - Michigan > Washtenaw County
    - Ann Arbor (0.04)
  - California > Santa Clara County
    - Sunnyvale (0.04)
- Europe
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
  - Finland > Uusimaa
    - Helsinki (0.04)
  - Czechia > South Moravian Region
    - Brno (0.04)
- Asia > China
  - Liaoning Province > Shenyang (0.04)

Genre:
- Research Report > Promising Solution (0.33)

Industry:
- Media (1.00)
- Law > Government & the Courts (0.67)
- Leisure & Entertainment (0.67)

Technology:
- Information Technology
  - Data Science (1.00)
  - Artificial Intelligence
    - Vision (1.00)
    - Speech > Speech Recognition (1.00)
    - Representation & Reasoning (1.00)
    - Natural Language
      - Text Processing (1.00)
      - Large Language Model (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found