Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning