The Multi-Agent Reinforcement Learning in Malm\"O (MARL\"O) Competition