Module-wise Adaptive Distillation for Multimodality Foundation Models

Neural Information Processing Systems 

Pre-trained multimodal foundation models have demonstrated remarkable general-izability but pose challenges for deployment due to their large sizes.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found