Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training

Open in new window