Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena

Open in new window