Scaling Multimodal Pre-Training via Cross-Modality Gradient Harmonization Suppmentary Material 1 Qualitative examples of our proposed measure based on agreement of gradients

Neural Information Processing Systems 

Howto100M, hence challenging the CMA assumption commonly used in multimodal pre-training[ ?

Similar Docs  Excel Report  more

TitleSimilaritySource
None found