Usage of speaker embeddings for more inclusive speech-to-text

Jun-26-2020, 11:17:52 GMT–AIHub

English is one of the most widely used languages worldwide, with approximately 1.2 billion speakers. In order to maximise the performance of speech-to-text systems it is vital to build them in a way that recognises different accents. Recently, spoken dialogue systems have been incorporated into various devices such as smartphones, call services, and navigation systems. These intelligent agents can assist users in performing daily tasks such as booking tickets, setting-up calendar items, or finding restaurants via spoken interaction. They have the potential to be more widely used in a vast range of applications in the future, especially in the education, government, healthcare, and entertainment sectors.

artificial intelligence, machine learning, natural language, (18 more...)

AIHub

Jun-26-2020, 11:17:52 GMT

News Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Speech > Speech Recognition (0.76)
  - Natural Language > Discourse & Dialogue (0.55)
  - Machine Learning > Neural Networks (0.50)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found