Learning Optimal Policy for Simultaneous Machine Translation via Binary Search