Enabling AI Scientists to Recognize Innovation: A Domain-Agnostic Algorithm for Assessing Novelty
Wang, Yao, Cui, Mingxuan, Jiang, Arthur
–arXiv.org Artificial Intelligence
In the pursuit of Artificial General Intelligence (AGI), automating the generation and evaluation of novel research ideas is a key challenge in AI-driven scientific discovery. This paper presents Relative Neighbor Density (RND), a domain-agnostic algorithm for novelty assessment in research ideas that overcomes the limitations of existing approaches by comparing an idea's local density with its adjacent neighbors' densities. We first developed a scalable methodology to create test set without expert labeling, addressing a fundamental challenge in novelty assessment. Using these test sets, we demonstrate that our RND algorithm achieves state-of-the-art (SOTA) performance in computer science (AUROC=0.820) and biomedical research (AUROC=0.765) domains. Most significantly, while SOTA models like Sonnet-3.7 and existing metrics show domain-specific performance degradation, RND maintains consistent accuracies across domains by its domain-invariant property, outperforming all benchmarks by a substantial margin (0.795 v.s. 0.597) on cross-domain evaluation. These results validate RND as a generalizable solution for automated novelty assessment in scientific research.
arXiv.org Artificial Intelligence
Mar-10-2025
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Promising Solution (0.68)
- Strength High (1.00)
- Research Report
- Industry:
- Health & Medicine
- Diagnostic Medicine (1.00)
- Epidemiology (0.93)
- Pharmaceuticals & Biotechnology (1.00)
- Therapeutic Area
- Endocrinology (0.68)
- Pulmonary/Respiratory Diseases (1.00)
- Neurology (1.00)
- Obstetrics/Gynecology (0.67)
- Immunology (1.00)
- Cardiology/Vascular Diseases (1.00)
- Hematology (1.00)
- Oncology (1.00)
- Infections and Infectious Diseases (1.00)
- Health & Medicine
- Technology:
- Information Technology
- Artificial Intelligence
- Cognitive Science (1.00)
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (0.92)
- Natural Language
- Chatbot (0.93)
- Large Language Model (1.00)
- Representation & Reasoning (1.00)
- Vision (1.00)
- Data Science > Data Mining (1.00)
- Artificial Intelligence
- Information Technology