Werewolf Arena: A Case Study in LLM Evaluation via Social Deduction