A Comprehensive Evaluation of Large Language Models on Temporal Event Forecasting