COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

Open in new window