Fusing ASR Outputs in Joint Training for Speech Emotion Recognition

Open in new window