Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot Study