Argument-Based Comparative Question Answering Evaluation Benchmark