Learning the Spectrogram Temporal Resolution for Audio Classification

Oct-6-2022, 14:43:35 GMT–#artificialintelligence

The audio spectrogram is a time-frequency representation that has been widely used for audio classification. The temporal resolution of a spectrogram depends on hop size. Previous works generally assume the hop size should be a constant value such as ten milliseconds. However, a fixed hop size or resolution is not always optimal for different types of sound. This paper proposes a novel method, DiffRes, that enables differentiable temporal resolution learning to improve the performance of audio classification models.

audio classification, hop size, spectrogram temporal resolution, (2 more...)

#artificialintelligence

Oct-6-2022, 14:43:35 GMT

News Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.87)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found