Towards the Scalable Evaluation of Cooperativeness in Language Models
Chan, Alan, Riché, Maxime, Clifton, Jesse
–arXiv.org Artificial Intelligence
It is likely that AI systems driven by pre-trained language models (PLMs) will increasingly be used to assist humans in high-stakes interactions with other agents, such as negotiation or conflict resolution. Consistent with the goals of Cooperative AI \citep{dafoe_open_2020}, we wish to understand and shape the multi-agent behaviors of PLMs in a pro-social manner. An important first step is the evaluation of model behaviour across diverse cooperation problems. Since desired behaviour in an interaction depends upon precise game-theoretic structure, we focus on generating scenarios with particular structures with both crowdworkers and a language model. Our work proceeds as follows. First, we discuss key methodological issues in the generation of scenarios corresponding to particular game-theoretic structures. Second, we employ both crowdworkers and a language model to generate such scenarios. We find that the quality of generations tends to be mediocre in both cases. We additionally get both crowdworkers and a language model to judge whether given scenarios align with their intended game-theoretic structure, finding mixed results depending on the game. Third, we provide a dataset of scenario based on our data generated. We provide both quantitative and qualitative evaluations of UnifiedQA and GPT-3 on this dataset. We find that instruct-tuned models tend to act in a way that could be perceived as cooperative when scaled up, while other models seemed to have flat scaling trends.
arXiv.org Artificial Intelligence
Mar-16-2023
- Country:
- Africa
- Kenya (0.04)
- South Africa (0.04)
- Asia
- Europe
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Netherlands (0.04)
- United Kingdom
- England
- Cambridgeshire > Cambridge (0.14)
- Oxfordshire > Oxford (0.04)
- Scotland (0.04)
- England
- Ireland > Leinster
- North America
- Canada > Quebec
- Montreal (0.04)
- Dominican Republic (0.04)
- United States
- Illinois > Cook County
- Chicago (0.04)
- New York > New York County
- New York City (0.04)
- Illinois > Cook County
- Canada > Quebec
- Africa
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Government (1.00)
- Law (1.00)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Leisure & Entertainment > Games (0.93)
- Technology: