A Survey on Machine Reading Comprehension: Tasks, Evaluation Metrics, and Benchmark Datasets