Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament