Similarity encoding for learning with dirty categorical variables

Open in new window