MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models

Neural Information Processing Systems 

Multimodal large language models (MLLMs) have demonstrated impressive capabilities across various vision-language tasks.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found