Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Recognition
Ye, Jiaxin, Wei, Yujie, Wen, Xin-Cheng, Ma, Chenglong, Huang, Zhizhong, Liu, Kunhong, Shan, Hongming
–arXiv.org Artificial Intelligence
Cross-corpus speech emotion recognition (SER) seeks to generalize the ability of inferring speech emotion from a well-labeled corpus to an unlabeled one, which is a rather challenging task due to the significant discrepancy between two corpora. Existing methods, typically based on unsupervised domain adaptation (UDA), struggle to learn corpus-invariant features by global distribution alignment, but unfortunately, the resulting features are mixed with corpus-specific features or not class-discriminative. To tackle these challenges, we propose a novel Emotion Decoupling aNd Alignment learning framework (EMO-DNA) for cross-corpus SER, a novel UDA method to learn emotion-relevant corpus-invariant features. The novelties of EMO-DNA are two-fold: contrastive emotion decoupling and dual-level emotion alignment. On one hand, our contrastive emotion decoupling achieves decoupling learning via a contrastive decoupling loss to strengthen the separability of emotion-relevant features from corpus-specific ones. On the other hand, our dual-level emotion alignment introduces an adaptive threshold pseudo-labeling to select confident target samples for class-level alignment, and performs corpus-level alignment to jointly guide model for learning class-discriminative corpus-invariant features across corpora. Extensive experimental results demonstrate the superior performance of EMO-DNA over the state-of-the-art methods in several cross-corpus scenarios. Source code is available at https://github.com/Jiaxin-Ye/Emo-DNA.
arXiv.org Artificial Intelligence
Aug-4-2023
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States
- Rhode Island (0.04)
- Nevada (0.04)
- Washington > King County
- Seattle (0.04)
- New York > New York County
- New York City (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > Los Angeles County
- Long Beach (0.14)
- Canada
- Quebec > Montreal (0.04)
- Ontario > National Capital Region
- Ottawa (0.05)
- Alberta > Census Division No. 6
- Calgary Metropolitan Region > Calgary (0.04)
- United States
- Europe
- Asia
- Singapore (0.04)
- Middle East > Jordan (0.04)
- Macao (0.04)
- China
- Heilongjiang Province > Harbin (0.04)
- Fujian Province > Xiamen (0.04)
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Oceania > Australia
- Genre:
- Research Report (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science > Emotion (0.72)
- Speech (0.68)
- Natural Language (0.68)
- Machine Learning > Neural Networks
- Deep Learning (0.93)
- Information Technology > Artificial Intelligence