Contributions to the Improvement of Question Answering Systems in the Biomedical Domain
–arXiv.org Artificial Intelligence
This thesis work falls within the framework of question answering (QA) in the biomedical domain where several specific challenges are addressed, such as specialized lexicons and terminologies, the types of treated questions, and the characteristics of targeted documents. We are particularly interested in studying and improving methods that aim at finding accurate and short answers to biomedical natural language questions from a large scale of biomedical textual documents in English. QA aims at providing inquirers with direct, short and precise answers to their natural language questions. In this Ph.D. thesis, we propose four contributions to improve the performance of QA in the biomedical domain. In our first contribution, we propose a machine learning-based method for question type classification to determine the types of given questions which enable to a biomedical QA system to use the appropriate answer extraction method. We also propose an another machine learning-based method to assign one or more topics (e.g., pharmacological, test, treatment, etc.) to given questions in order to determine the semantic types of the expected answers which are very useful in generating specific answer retrieval strategies. In the second contribution, we first propose a document retrieval method to retrieve a set of relevant documents that are likely to contain the answers to biomedical questions from the MEDLINE database. We then present a passage retrieval method to retrieve a set of relevant passages to questions. In the third contribution, we propose specific answer extraction methods to generate both exact and ideal answers. Finally, in the fourth contribution, we develop a fully automated semantic biomedical QA system called SemBioNLQA which is able to deal with a variety of natural language questions and to generate appropriate answers by providing both exact and ideal answers.
arXiv.org Artificial Intelligence
Jul-25-2023
- Country:
- North America
- United States
- New York > New York County
- New York City (0.04)
- Maryland
- Montgomery County > Gaithersburg (0.04)
- Baltimore County (0.04)
- Baltimore (0.04)
- California
- Santa Clara County > Palo Alto (0.04)
- San Francisco County > San Francisco (0.04)
- New York > New York County
- Canada > British Columbia
- United States
- Europe
- United Kingdom > England
- South Yorkshire > Sheffield (0.04)
- Spain > Canary Islands
- Tenerife (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Netherlands > South Holland
- Delft (0.04)
- Italy > Liguria
- Genoa (0.04)
- France
- Occitanie > Haute-Garonne
- Toulouse (0.04)
- Bourgogne-Franche-Comté > Doubs
- Besançon (0.04)
- Occitanie > Haute-Garonne
- Bulgaria > Sofia City Province
- Sofia (0.04)
- United Kingdom > England
- Africa > Middle East
- Morocco
- Rabat-Salé-Kénitra Region > Rabat (0.04)
- Fès-Meknès Region > Fez (0.04)
- Casablanca-Settat Region > Casablanca (0.04)
- Morocco
- North America
- Genre:
- Workflow (1.00)
- Research Report > New Finding (1.00)
- Overview (1.00)
- Industry:
- Education (0.92)
- Health & Medicine
- Pharmaceuticals & Biotechnology (1.00)
- Health Care Providers & Services (1.00)
- Consumer Health (0.92)
- Health Care Technology > Medical Record (0.67)
- Therapeutic Area
- Oncology (1.00)
- Neurology (1.00)
- Infections and Infectious Diseases (1.00)
- Genetic Disease (1.00)
- Hematology (0.67)
- Technology:
- Information Technology > Artificial Intelligence
- Representation & Reasoning
- Ontologies (1.00)
- Expert Systems (0.92)
- Natural Language
- Text Processing (1.00)
- Text Classification (1.00)
- Question Answering (1.00)
- Information Retrieval (1.00)
- Grammars & Parsing (1.00)
- Information Extraction (0.93)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Performance Analysis > Accuracy (0.68)
- Learning Graphical Models > Directed Networks
- Bayesian Learning (0.93)
- Representation & Reasoning
- Information Technology > Artificial Intelligence