Leveraging Label Information for Multimodal Emotion Recognition