Extracting Speaker-Specific Information with a Regularized Siamese Deep Network