Investigating Deep Neural Transformations for Spectrogram-based Musical Source Separation

Choi, Woosung, Kim, Minseok, Chung, Jaehwa, Lee, Daewon, Jung, Soonyoung

Dec-9-2019–arXiv.org Machine Learning

Musical Source Separation (MSS) is a signal processing task that tries to separate the mixed musical signal into each acoustic sound source, such as singing voice or drums. Recently many machine learning-based methods have been proposed for the MSS task, but there were no existing works that evaluate and directly compare various types of networks. In this paper, we aim to design a variety of neural transformation methods, including time-invariant methods, time-frequency methods, and mixtures of two different transformations. Our experiments provide abundant material for future works by comparing several transformation methods. We train our models on raw complex-valued STFT outputs and achieve state-of-the-art SDR performance on the MUSDB singing voice separation task by a large margin of 1.0 dB. 1 Introduction For a given mixed musical signal composed of several instrumental sounds, Musical Source Separation (MSS) is a signal processing task that tries to separate the mixture source into each acoustic sound source, such as singing voice or drums.

frequency resolution, spectrogram, transformation, (14 more...)

arXiv.org Machine Learning

Dec-9-2019

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found