Fine-grained Image Captioning with CLIP Reward