Randomized Smoothing Meets Vision-Language Models