ConcateNet: Dialogue Separation Using Local And Global Feature Concatenation
Halimeh, Mhd Modar, Torcoli, Matteo, Habets, Emanuël
–arXiv.org Artificial Intelligence
Dialogue separation involves isolating a dialogue signal from a mixture, such as a movie or a TV program. This can be a necessary step to enable dialogue enhancement for broadcast-related applications. In this paper, ConcateNet for dialogue separation is proposed, which is based on a novel approach for processing local and global features aimed at better generalization for out-of-domain signals. ConcateNet is trained using a noise reduction-focused, publicly available dataset and evaluated using three datasets: two noise reduction-focused datasets (in-domain), which show competitive performance for ConcateNet, and a broadcast-focused dataset (out-of-domain), which verifies the better generalization performance for the proposed architecture compared to considered state-of-the-art noise-reduction methods.
arXiv.org Artificial Intelligence
Aug-16-2024
- Country:
- Oceania > Australia
- Queensland > Brisbane (0.04)
- North America
- United States > Rhode Island (0.04)
- Canada
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Europe
- Greece (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Germany > Bavaria
- Middle Franconia > Nuremberg (0.04)
- Asia
- South Korea > Seoul
- Seoul (0.04)
- Singapore > Central Region
- Singapore (0.04)
- China > Shanghai
- Shanghai (0.04)
- South Korea > Seoul
- Oceania > Australia
- Genre:
- Research Report > Promising Solution (0.34)
- Industry:
- Media (0.48)
- Technology: