Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena

Jan-19-2025, 15:30:04 GMT–Neural Information Processing Systems

[no summary]

agreement, human preference, llm-as-a-judge, (3 more...)

Neural Information Processing Systems

Jan-19-2025, 15:30:04 GMT

Conferences Web Page

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.99)