Estimating the Error of Large Language Models at Pairwise Text Comparison

Open in new window