EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning