Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language Models