ConViTac: Aligning Visual-Tactile Fusion with Contrastive Representations