Large Language Models are not Fair Evaluators