Enabling AI Scientists to Recognize Innovation: A Domain-Agnostic Algorithm for Assessing Novelty
Wang, Yao, Cui, Mingxuan, Jiang, Arthur
–arXiv.org Artificial Intelligence
In the pursuit of Artificial General Intelligence (AGI), automating the generation and evaluation of novel research ideas is a key challenge in AI-driven scientific discovery. This paper presents Relative Neighbor Density (RND), a domain-agnostic algorithm for novelty assessment in research ideas that overcomes the limitations of existing approaches by comparing an idea's local density with its adjacent neighbors' densities. We first developed a scalable methodology to create test set without expert labeling, addressing a fundamental challenge in novelty assessment. Using these test sets, we demonstrate that our RND algorithm achieves state-of-the-art (SOTA) performance in computer science (AUROC=0.820) and biomedical research (AUROC=0.765) domains. Most significantly, while SOTA models like Sonnet-3.7 and existing metrics show domain-specific performance degradation, RND maintains consistent accuracies across domains by its domain-invariant property, outperforming all benchmarks by a substantial margin (0.795 v.s. 0.597) on cross-domain evaluation. These results validate RND as a generalizable solution for automated novelty assessment in scientific research.
arXiv.org Artificial Intelligence
Mar-10-2025
- Country:
- North America > United States (0.04)
- Europe > Italy (0.04)
- Africa > Sub-Saharan Africa (0.04)
- Asia > China
- Hubei Province > Wuhan (0.04)
- Genre:
- Research Report
- Strength High (1.00)
- New Finding (1.00)
- Experimental Study (1.00)
- Promising Solution (0.68)
- Research Report
- Industry:
- Health & Medicine
- Pharmaceuticals & Biotechnology (1.00)
- Diagnostic Medicine (1.00)
- Epidemiology (0.93)
- Therapeutic Area
- Infections and Infectious Diseases (1.00)
- Oncology (1.00)
- Hematology (1.00)
- Cardiology/Vascular Diseases (1.00)
- Immunology (1.00)
- Neurology (1.00)
- Pulmonary/Respiratory Diseases (1.00)
- Endocrinology (0.68)
- Obstetrics/Gynecology (0.67)
- Health & Medicine
- Technology:
- Information Technology
- Data Science > Data Mining (1.00)
- Artificial Intelligence
- Vision (1.00)
- Representation & Reasoning (1.00)
- Cognitive Science (1.00)
- Natural Language
- Large Language Model (1.00)
- Chatbot (0.93)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Information Technology