CLARA: Multilingual Contrastive Learning for Audio Representation Acquisition