Judging LLMs on a Simplex

Open in new window