Grounded Reinforcement Learning: Learning to Win the Game under Human Commands Supplementary Materials