Continuous Rating as Reliable Human Evaluation of Simultaneous Speech Translation