GameEval: Evaluating LLMs on Conversational Games