Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented T asks Supplementary Materials