ICLEval: Evaluating In-Context Learning Ability of Large Language Models