End-to-End Speech Recognition: A Survey
Prabhavalkar, Rohit, Hori, Takaaki, Sainath, Tara N., Schlüter, Ralf, Watanabe, Shinji
–arXiv.org Artificial Intelligence
Within components (models, knowledge sources) of an ASR system the classical approach, deep learning has been introduced before coming to a decision. This is in line with Bayes' to acoustic and language modeling. In acoustic modeling, decision rule, which exactly requires a single global decision deep learning replaced Gaussian mixture distributions (hybrid integrating all available knowledge sources. HMM [3], [4]) or augmented the acoustic feature set c) Joint Training: In terms of model training, E2E suggests (nonlinear disciminant/tandem approach [5], [6]). In language estimating all parameters of all components of a model modeling, deep learning replaced count-based approaches [7], jointly using a single objective function that is consistent with [8], [9]. However, when introducing deep learning, the classical the task at hand, which in case of ASR means minimizing the ASR architecture was not yet touched. Classical stateof-the-art expected word error rate. ASR systems today are composed of many separate d) Training Data: Joint training of an integrated model components and knowledge sources, especially speech signal implies using a single kind of training data, which in case preprocessing, methods for robustness w.r.t.
arXiv.org Artificial Intelligence
Mar-2-2023
- Country:
- Asia
- Europe (1.00)
- North America
- Canada (1.00)
- United States > California (0.67)
- Oceania > Australia (0.67)
- Genre:
- Overview (1.00)
- Research Report (1.00)
- Industry:
- Education > Educational Setting (0.67)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science (1.00)
- Machine Learning
- Learning Graphical Models
- Directed Networks > Bayesian Learning (0.45)
- Undirected Networks > Markov Models (0.46)
- Neural Networks > Deep Learning (1.00)
- Learning Graphical Models
- Representation & Reasoning (1.00)
- Speech > Speech Recognition (1.00)
- Information Technology > Artificial Intelligence