DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech

Mar-12-2023–arXiv.org Artificial Intelligence

The majority of current Text-to-Speech (TTS) datasets, which are collections of individual utterances, contain few conversational aspects. In this paper, we introduce DailyTalk, a high-quality conversational speech dataset designed for conversational TTS. We sampled, modified, and recorded 2,541 dialogues from the open-domain dialogue dataset DailyDialog inheriting its annotated attributes. On top of our dataset, we extend prior work as our baseline, where a non-autoregressive TTS is conditioned on historical information in a dialogue. From the baseline experiment with both general and our novel metrics, we show that DailyTalk can be used as a general TTS dataset, and more than that, our baseline can represent contextual information from DailyTalk. The DailyTalk dataset and baseline code are freely available for academic use with CC-BY-SA 4.0 license.

artificial intelligence, natural language, optical character recognition, (20 more...)

arXiv.org Artificial Intelligence

Mar-12-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States (0.04)
  - Canada > Quebec
    - Montreal (0.05)
- Europe > Italy
  - Tuscany > Florence (0.04)
  - Calabria > Catanzaro Province
    - Catanzaro (0.04)
- Asia > Taiwan
  - Taiwan Province > Taipei (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Vision > Optical Character Recognition (0.62)
  - Speech > Speech Synthesis (0.62)
  - Natural Language > Discourse & Dialogue (0.50)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found