Model Composition for Multimodal Large Language Models