Reverb: Open-Source ASR and Diarization from Rev

Bhandari, Nishchal, Chen, Danny, Fernández, Miguel Ángel del Río, Delworth, Natalie, Fox, Jennifer Drexler, Jetté, Migüel, McNamara, Quinten, Miller, Corey, Novotný, Ondřej, Profant, Ján, Qin, Nan, Ratajczak, Martin, Robichaud, Jean-Philippe

Oct-4-2024–arXiv.org Artificial Intelligence

Today, we are open-sourcing our core speech recognition and diarization models for non-commercial use. We are releasing both a full production pipeline for developers as well as pared-down research models for experimentation. Rev hopes that these releases will spur research and innovation in the fast-moving domain of voice technology. The speech recognition models released today outperform all existing open source speech recognition models across a variety of long-form speech recognition domains.

open-source asr and diarization, rev, speech recognition, (10 more...)

arXiv.org Artificial Intelligence

Oct-4-2024

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.51)

Technology:
- Information Technology > Artificial Intelligence
  - Speech > Speech Recognition (1.00)
  - Machine Learning (1.00)
  - Natural Language (0.96)