emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation