Multiple-Instance, Cascaded Classification for Keyword Spotting in Narrow-Band Audio
AbdulKader, Ahmad, Nassar, Kareem, El-Geish, Mohamed, Galvez, Daniel, Patil, Chetan
–arXiv.org Artificial Intelligence
We propose using cascaded classifiers for a keyword spotting (KWS) task on narrow-band (NB), 8kHz audio acquired in non-IID environments -- a more challenging task than most state-of-the-art KWS systems face. We present a model that incorporates Deep Neural Networks (DNNs), cascading, multiple-feature representations, and multiple-instance learning. The cascaded classifiers handle the task's class imbalance and reduce power consumption on computationally-constrained devices via early termination. The KWS system achieves a false negative rate of 6% at an hourly false positive rate of 0.75
arXiv.org Artificial Intelligence
Apr-28-2025
- Country:
- Asia > Singapore (0.04)
- Europe
- Germany > Saxony
- Dresden (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Germany > Saxony
- North America > United States
- California > Los Angeles County > Long Beach (0.04)
- Genre:
- Research Report (0.50)
- Technology: