CgT-GAN: CLIP-guided Text GAN for Image Captioning