Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and Beyond