Self-supervised reinforcement learning for speaker localisation with the iCub humanoid robot