Clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents

Open in new window