Policy Learning for Social Robot-Led Physiotherapy