An Examination of the Robustness of Reference-Free Image Captioning Evaluation Metrics