MCRL4OR: Multimodal Contrastive Representation Learning for Off-Road Environmental Perception