Supplementary Materials A Experiment As suggested by one reviewer, we conduct the following experiment over Cartpole in OpenAI gym to