Split and Merge: Aligning Position Biases in Large Language Model based Evaluators

Open in new window