AutoBench: Automating LLM Evaluation through Reciprocal Peer Assessment

Open in new window