A Comprehensive Evaluation on Event Reasoning of Large Language Models