Sound source detection, localization and classification using consecutive ensemble of CRNN models

Aug-2-2019–arXiv.org Machine Learning

Each of these models is a copy of a single SELDnet node with just minor adjustments so that it fits to the specific subtask and for the regularization purpose. Each of these models takes as an input a fixed length subsequence of decibel scale amplitude spectrograms (in case of noas and class subtasks) or both decibel scale amplitude and phase spectrograms (in case of doa1 and doa2 subtasks) from all 4 channels. In each case, input layers are followed by 3 convolutional layer blocks made of convolutional layer, batch norm, relu activation, maxpool and dropout. The output from the last convolutional block is reshaped so that it forms a multivariate sequence of a fixed length. In the case of doa2, we additionaly concatenate directions of arrivals of associated events with this multivariate sequence.

deep learning, neural network, subtask, (16 more...)

arXiv.org Machine Learning

Aug-2-2019

arXiv.org PDF

Add feedback

Country:
- Europe > Poland (0.14)

Genre:
- Research Report (0.51)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found