CentralNet: a Multilayer Approach for Multimodal Fusion