Early Classifying Multimodal Sequences