Tempo vs. Pitch: understanding self-supervised tempo estimation