CHIMERA: A Knowledge Base of Scientific Idea Recombinations for Research Analysis and Ideation
–arXiv.org Artificial Intelligence
A hallmark of human innovation is recombination -- the creation of novel ideas by integrating elements from existing concepts and mechanisms. In this work, we introduce CHIMERA, a large-scale Knowledge Base (KB) of over 28K recombination examples automatically mined from the scientific literature. CHIMERA enables large-scale empirical analysis of how scientists recombine concepts and draw inspiration from different areas, and enables training models that propose novel, cross-disciplinary research directions. To construct this KB, we define a new information extraction task: identifying recombination instances in scientific abstracts. We curate a high-quality, expert-annotated dataset and use it to fine-tune a large language model, which we apply to a broad corpus of AI papers. We showcase the utility of CHIMERA through two applications. First, we analyze patterns of recombination across AI subfields. Second, we train a scientific hypothesis generation model using the KB, showing that it can propose novel research directions that researchers rate as inspiring. We release our data and code at https://github.com/noy-sternlicht/CHIMERA-KB.
arXiv.org Artificial Intelligence
Jul-30-2025
- Country:
- Asia
- Middle East > Israel
- Jerusalem District > Jerusalem (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Middle East > Israel
- Europe > Austria
- Vienna (0.14)
- North America > United States
- District of Columbia > Washington (0.04)
- Asia
- Genre:
- Research Report > Promising Solution (0.48)
- Industry:
- Education (0.93)
- Health & Medicine
- Pharmaceuticals & Biotechnology (1.00)
- Therapeutic Area (1.00)
- Technology: