Safer Deep RL with Shallow MCTS: A Case Study in Pommerman

Open in new window