Heterogeneous bimodal attention fusion for speech emotion recognition

Open in new window