Mozualization: Crafting Music and Visual Representation with Multimodal AI