Contrastive Language-Image Pretrained Models are Zero-Shot Human Scanpath Predictors