Biomimetic Frontend for Differentiable Audio Processing

Famularo, Ruolan Leslie, Zotkin, Dmitry N., Shamma, Shihab A., Duraiswami, Ramani

Sep-13-2024–arXiv.org Artificial Intelligence

While models in audio and speech processing are becoming deeper and more end-to-end, they as a consequence need expensive training on large data, and are often brittle. We build on a classical model of human hearing and make it differentiable, so that we can combine traditional explainable biomimetic signal processing approaches with deep-learning frameworks. This allows us to arrive at an expressive and explainable model that is easily trained on modest amounts of data. We apply this model to audio processing tasks, including classification and enhancement. Results show that our differentiable model surpasses black-box approaches in terms of computational efficiency and robustness, even with little training data. We also discuss other potential applications.

frontend, modulation, recognition, (15 more...)

arXiv.org Artificial Intelligence

Sep-13-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - Maryland > Prince George's County
    - College Park (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report > New Finding (0.34)

Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.47)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found