Potential-based Reward Shaping in Sokoban