Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language Models

Open in new window