Toward Multimodal Model-Agnostic Meta-Learning