Evaluation of Question Answering Systems: Complexity of judging a natural language

Farea, Amer, Yang, Zhen, Duong, Kien, Perera, Nadeesha, Emmert-Streib, Frank

Sep-10-2022–arXiv.org Artificial Intelligence

Question answering (QA) systems are among the most important and rapidly developing research topics in natural language processing (NLP). A reason, therefore, is that a QA system allows humans to interact more naturally with a machine, e.g., via a virtual assistant or search engine. In the last decades, many QA systems have been proposed to address the requirements of different question-answering tasks. Furthermore, many error scores have been introduced, e.g., based on n-gram matching, word embeddings, or contextual embeddings to measure the performance of a QA system. This survey attempts to provide a systematic overview of the general framework of QA, QA paradigms, benchmark datasets, and assessment techniques for a quantitative evaluation of QA systems. The latter is particularly important because not only is the construction of a QA system complex but also its evaluation. We hypothesize that a reason, therefore, is that the quantitative formalization of human judgment is an open problem.

machine learning, question answering, springer nature 2021, (18 more...)

arXiv.org Artificial Intelligence

Sep-10-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.28)
- Europe
  - United Kingdom (0.04)
  - Russia (0.04)
  - Finland > Pirkanmaa
    - Tampere (0.04)
- Asia
  - Russia (0.04)
  - Middle East
    - UAE (0.05)
    - Iraq (0.04)

Genre:
- Overview (1.00)

Industry:
- Education (0.68)
- Media (0.45)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Question Answering (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found