A Concept-Centric Approach to Multi-Modality Learning