Semi-supervised Clustering Through Representation Learning of Large-scale EHR Data