From Concrete to Abstract: A Multimodal Generative Approach to Abstract Concept Learning