Contrastive Learning for Image Captioning