LLMEval: A Preliminary Study on How to Evaluate Large Language Models