version of our paper, we shall clarify the details in Section 3 (R2), and make intuition in the methods section much

Neural Information Processing Systems 

We thank the reviewers for the detailed comments, suggestions, and a positive assessment of our work. We will correct for color schemes in all figures (R1). We have also made captions of figures cleaner (R3). We have added a description of the setup to the paper. In Fig 5 (left), DisCor actually outperforms Unif( s,a) on these environments.