The Stanford Question Answering Dataset
Stanford Question Answering Dataset (SQuAD) is a new reading comprehension dataset, consisting of questions posed by crowdworkers on a set of Wikipedia articles, where the answer to every question is a segment of text, or span, from the corresponding reading passage. With 100,000 question-answer pairs on 500 articles, SQuAD is significantly larger than previous reading comprehension datasets. We've built a few resources to help you get started with the dataset. Download a copy of the dataset (distributed under the CC BY-SA 4.0 license): To evaluate your models, we have also made available the evaluation script we will use for official evaluation, along with a sample prediction file that the script will take as input. To run the evaluation, use python evaluate-v1.1.py
Nov-4-2016, 14:50:06 GMT
- Technology: