MarkQA: A large scale KBQA dataset with numerical reasoning

Huang, Xiang, Cheng, Sitao, Bao, Yuheng, Huang, Shanshan, Qu, Yuzhong

Dec-13-2023–arXiv.org Artificial Intelligence

While question answering over knowledge bases (KBQA) has shown progress in addressing factoid questions, KBQA with numerical reasoning remains relatively unexplored. In this paper, we focus on the complex numerical reasoning in KBQA and propose a new task, NR-KBQA, which necessitates the ability to perform both multi-hop reasoning and numerical reasoning. We design a logic form in Python format called PyQL to represent the reasoning process of numerical reasoning questions. To facilitate the development of NR-KBQA, we present a large dataset called MarkQA, which is automatically constructed from a small set of seeds. Each question in MarkQA is equipped with its corresponding SPARQL query, alongside the step-by-step reasoning process in the QDMR format and PyQL program. Experimental results of some state-of-the-art QA methods on the MarkQA show that complex numerical reasoning in KBQA faces great challenges.

numerical reasoning, pyql, reasoning, (16 more...)

arXiv.org Artificial Intelligence

Dec-13-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York > New York County
    - New York City (0.04)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
  - Louisiana > Orleans Parish
    - New Orleans (0.04)
- Europe
  - Russia (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
- Asia
  - Russia (0.04)
  - Japan (0.04)
  - Middle East
    - Qatar (0.04)
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.04)
  - China
    - Shanghai > Shanghai (0.04)
    - Jiangsu Province > Nanjing (0.04)
    - Hong Kong (0.04)
    - Beijing > Beijing (0.04)

Genre:
- Research Report (0.82)

Industry:
- Leisure & Entertainment > Games (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Cognitive Science > Problem Solving (1.00)
  - Natural Language > Question Answering (0.67)