Clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents