Grounded ReinforcementLearning: LearningtoWintheGameunderHumanCommands SupplementaryMaterials