Supplementary Material MICo Improved representations via sampling based state similarity for Markov decision processes A Extended background material