Potential-based Reward Shaping in Sokoban

Open in new window