MICo: Learning improved representations via sampling-based state similarity for Markov decision processes