The Interspeech Zero Resource Speech Challenge 2021: Spoken language modelling

Dunbar, Ewan, Bernard, Mathieu, Hamilakis, Nicolas, Nguyen, Tu Anh, de Seyssel, Maureen, Rozé, Patricia, Rivière, Morgane, Kharitonov, Eugene, Dupoux, Emmanuel

Apr-29-2021–arXiv.org Artificial Intelligence

We present the Zero Resource Speech Challenge 2021, which asks participants to learn a language model directly from audio, without any text or labels. The challenge is based on the Libri-light dataset, which provides up to 60k hours of audio from English audio books without any associated text. We provide a pipeline baseline system consisting on an encoder based on contrastive predictive coding (CPC), a quantizer ($k$-means) and a standard language model (BERT or LSTM). The metrics evaluate the learned representations at the acoustic (ABX discrimination), lexical (spot-the-word), syntactic (acceptability judgment) and semantic levels (similarity judgment). We present an overview of the eight submitted systems from four groups and discuss the main results.

baseline, representation, zero resource speech challenge 2021, (11 more...)

arXiv.org Artificial Intelligence

Apr-29-2021

arXiv.org PDF

Add feedback

Country:
- Europe > France (0.04)
- North America
  - United States > Minnesota
    - Hennepin County > Minneapolis (0.14)
  - Canada > Ontario
    - Toronto (0.14)

Genre:
- Overview (0.54)
- Research Report (0.40)

Industry:
- Law (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.72)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found