PyramidCLIP: HierarchicalFeatureAlignmentfor Vision-languageModelPretraining AnonymousAuthor(s) Affiliation Address email

Neural Information Processing Systems 

Zhuang, K. Li, H. Cheng, X. Guo, F. Huang, R. Ji, and X. Sun, "Disco: Remedy213 self-supervised learning on lightweight models with distilled contrastive learning,"arXiv preprint214 arXiv:2104.09124,2021.215