Enhancing Speech Emotion Recognition through Segmental Average Pooling of Self-Supervised Learning Features