Large-scale learning of generalised representations for speaker recognition