Facebook releases low-latency online speech recognition framework

Jan-16-2020, 14:40:44 GMT–#artificialintelligence

Facebook AI Research (FAIR) today said it's open-sourcing wav2letter@anywhere, a deep learning-based inference framework that achieves fast performance for online automatic speech recognition in cloud or embedded edge environments. Wav2letter@anywhere is based on neural net-based language models wav2letter and wav2letter, which upon its release in December 2018, FAIR called the fastest open source speech recognition system available. Automatic speech recognition, or ASR, is used to turn audio of spoken words into text, then infer the speaker's intent in order to carry out a task. An API available on GitHub though the wav2letter repository is built to support concurrent audio streams and popular kinds of deep learning speech recognition models like convolutional neural networks (CNN) or recurrent neural networks (RNN) in order to deliver scale necessary for online ASR. Wav2letter@anywhere achieves better word error rate performance than two baseline models made from bidirectional LSTM RNNs, according to a paper released last week by eight FAIR researchers from labs in New York City and at company headquarters in Menlo Park.

facebook release, low-latency online speech recognition framework, speech recognition, (10 more...)

#artificialintelligence

Jan-16-2020, 14:40:44 GMT

News Web Page

Add feedback

Country:
- North America > United States > New York (0.26)

Technology:
- Information Technology > Artificial Intelligence
  - Speech > Speech Recognition (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found