The range of x for sampling on leftmost node

Neural Information Processing Systems 

Table 1: Averaged samples to reach the reward threshold on Mujoco-V1. Rebuttal-Figure 1: LA-MCTS on Walker2d Table. 2 in the main paper uses Mujoco-V2. We sincerely thank reviewers R1, R2, R3 for their constructive feedbacks. We redo the experiment on Mujoco-V1 in Table. 1. LA-MCTS shows This is when a plateau of regret happens. We will clarify it in the paper.