Clustering for Protein Representation Learning