Split and Merge: Aligning Position Biases in Large Language Model based Evaluators