Learning to Generate Novel Scientific Directions with Contextualized Literature-based Discovery
Wang, Qingyun, Downey, Doug, Ji, Heng, Hope, Tom
–arXiv.org Artificial Intelligence
Literature-Based Discovery (LBD) aims to discover new scientific knowledge by mining papers and generating hypotheses. Standard LBD is limited to predicting pairwise relations between discrete concepts (e.g., drug-disease links), and ignores critical contexts like experimental settings (e.g., a specific patient population where a drug is evaluated) and background motivations (e.g., to find drugs without specific side effects). We address these limitations with a novel formulation of contextualized-LBD (C-LBD): generating scientific hypotheses in natural language, while grounding them in a context that controls the hypothesis search space. We present a modeling framework using retrieval of ``inspirations'' from past scientific papers. Our evaluations reveal that GPT-4 tends to generate ideas with overall low technical depth and novelty, while our inspiration prompting approaches partially mitigate this issue. Our work represents a first step toward building language models that generate new ideas derived from scientific literature.
arXiv.org Artificial Intelligence
Jan-22-2024
- Country:
- Europe (1.00)
- North America > United States
- Minnesota > Hennepin County > Minneapolis (0.14)
- Genre:
- Research Report
- Experimental Study (0.67)
- New Finding (0.46)
- Promising Solution (0.67)
- Research Report
- Industry:
- Education (0.93)
- Health & Medicine (1.00)
- Technology: