Appendix Effective Population Based Reinforcement Learning 8 Additional Experiment Details