Supplementary Information: Meta-ReinforcementLearningwith Self-ModifyingNetworks 9 Optimization