Unsupervised Control Through Non-Parametric Discriminative Rewards