Werewolf Arena: A Case Study in LLM Evaluation via Social Deduction

Open in new window