Dropping Just a Handful of Preferences Can Change Top Large Language Model Rankings

Open in new window