Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection

Cornell, Samuele, Balestri, Thomas, Sénéchal, Thibaud

Oct-4-2022–arXiv.org Artificial Intelligence

In these are increasingly realistic or include celebrity-derived custom instances, the performance of tasks such as keyword-spotting voices. This can lead to the device "self waking" and continuously (KWS) and device-directed speech detection (DDD) can degrade interrupting itself as the model, alone, cannot implicitly significantly. To address this problem, we propose an distinguish between user and device speech and ignore this implicit acoustic echo cancellation (iAEC) framework where latter. Such problem also affects automatic speech recognition a neural network is trained to exploit the additional information (ASR) or keyword-less initiated interactions, such as from a reference microphone channel to learn to ignore device-directed detection (DDD) [7-10]. One trivial way to the interfering signal and improve detection performance. We mitigate this issue would be disabling the KWS functionality study this framework for the tasks of KWS and DDD on, while the device is in playback. Yet, doing so prevents the respectively, an augmented version of Google Speech Commands user to "barge in", making the interaction significantly less v2 and a real-world Alexa device dataset.

artificial intelligence, machine learning, playback, (19 more...)

arXiv.org Artificial Intelligence

Oct-4-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.04)
- Europe > Italy
  - Marche (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence
  - Speech (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found