Controlling for Confounders in Multimodal Emotion Classification via Adversarial Learning