Detecting Emotion Carriers by Combining Acoustic and Lexical Representations