MultiModal-Learning for Predicting Molecular Properties: A Framework Based on Image and Graph Structures