CAT: CRF-based ASR Toolkit

Nov-20-2019–arXiv.org Machine Learning

ABSTRACT In this paper, we present a new open source toolkit for automatic speech recognition (ASR), named CA T (CRF-based ASR Toolkit). A key feature of CA T is discriminative training in the framework of conditional random field (CRF), particularly with connectionist temporal classification (CTC) inspired state topology. CA T contains a full-fledged implementation of CTC-CRF and provides a complete workflow for CRF-based end-to-end speech recognition. Evaluation results on Chinese and English benchmarks such as Switchboard and Aishell show that CA T obtains the state-of-the-art results among existing end-to-end models with less parameters, and is competitive compared with the hybrid DNN-HMM models. Towards flexibility, we show that i-vector based speaker-adapted recognition and latency control mechanism can be explored easily and effectively in CA T. We hope CA T, especially the CRF-based framework and software, will be of broad interest to the community, and can be further explored and improved. Index T erms-- speech recognition, open source toolkit, conditional random field, end-to-end 1. INTRODUCTION In addition to theories and algorithms, open source toolkits make substantial contributions to automatic speech recognition (ASR) technologies.

neural network, recognition, speech recognition, (17 more...)

arXiv.org Machine Learning

Nov-20-2019

arXiv.org PDF

Add feedback

Country:
- Asia > China (0.04)
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Speech > Speech Recognition (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found