Frame-level emotional state alignment method for speech emotion recognition