To Align or Not to Align: Strategic Multimodal Representation Alignment for Optimal Performance