Librispeech Transducer Model with Internal Language Model Prior Correction

Zeyer, Albert, Merboldt, André, Michel, Wilfried, Schlüter, Ralf, Ney, Hermann

Apr-7-2021–arXiv.org Artificial Intelligence

We present our transducer model on Librispeech. We study variants to include an external language model (LM) with shallow fusion and subtract an estimated internal LM. This is justified by a Bayesian interpretation where the transducer model prior is given by the estimated internal LM. The subtraction of the internal LM gives us over 14% relative improvement over normal shallow fusion. Our transducer has a separate probability distribution for the non-blank labels which allows for easier combination with the external LM, and easier estimation of the internal LM. We additionally take care of including the end-of-sentence (EOS) probability of the external LM in the last blank probability which further improves the performance. All our code and setups are published.

shallow fusion, speech recognition, transducer, (11 more...)

arXiv.org Artificial Intelligence

Apr-7-2021

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America
  - United States
    - Georgia > Fulton County
      - Atlanta (0.04)
    - California > San Diego County
      - San Diego (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - Germany > North Rhine-Westphalia
    - Cologne Region > Aachen (0.05)
  - Austria > Styria
    - Graz (0.04)
- Asia
  - Singapore (0.04)
  - India > Telangana
    - Hyderabad (0.04)
  - China > Shanghai
    - Shanghai (0.04)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.99)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found