HeLo: Heterogeneous Multi-Modal Fusion with Label Correlation for Emotion Distribution Learning