Reverb: Open-Source ASR and Diarization from Rev
Bhandari, Nishchal, Chen, Danny, Fernández, Miguel Ángel del Río, Delworth, Natalie, Fox, Jennifer Drexler, Jetté, Migüel, McNamara, Quinten, Miller, Corey, Novotný, Ondřej, Profant, Ján, Qin, Nan, Ratajczak, Martin, Robichaud, Jean-Philippe
–arXiv.org Artificial Intelligence
Today, we are open-sourcing our core speech recognition and diarization models for non-commercial use. We are releasing both a full production pipeline for developers as well as pared-down research models for experimentation. Rev hopes that these releases will spur research and innovation in the fast-moving domain of voice technology. The speech recognition models released today outperform all existing open source speech recognition models across a variety of long-form speech recognition domains.
arXiv.org Artificial Intelligence
Oct-4-2024
- Genre:
- Research Report (0.51)
- Technology:
- Information Technology > Artificial Intelligence
- Speech > Speech Recognition (1.00)
- Machine Learning (1.00)
- Natural Language (0.96)
- Information Technology > Artificial Intelligence