Distilling Privileged Multimodal Information for Expression Recognition using Optimal Transport