CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval