End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results

Chorowski, Jan, Bahdanau, Dzmitry, Cho, Kyunghyun, Bengio, Yoshua

Dec-4-2014–arXiv.org Machine Learning

Dzmitry Bahdanau Jacobs University Bremen, Germany Yoshua Bengio Université de Montréal CIFAR Senior Fellow We replace the Hidden Markov Model (HMM) which is traditionally used in in continuous speech recognition with a bidirectional recurrent neural network encoder coupled to a recurrent neural network decoder that directly emits a stream of phonemes. The alignment between the input and output sequences is established using an attention mechanism: the decoder emits each symbol based on a context created with a subset of input symbols selected by the attention mechanism. We report initial results demonstrating that this new approach achieves phoneme error rates that are comparable to the state-of-the-art HMM-based decoders, on the TIMIT dataset.

artificial intelligence, machine learning, sequence, (18 more...)

arXiv.org Machine Learning

Dec-4-2014

arXiv.org PDF

Add feedback

Country:
- North America > Canada
  - Quebec > Montreal (0.24)
- Europe > Germany
  - Bremen > Bremen (0.24)

Genre:
- Research Report > New Finding (0.48)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks > Deep Learning (1.00)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.89)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found