InterFeat: A Pipeline for Finding Interesting Scientific Features
Ofer, Dan, Linial, Michal, Shahaf, Dafna
–arXiv.org Artificial Intelligence
Finding interesting phenomena is the core of scientific discovery, but it is a manual, ill-defined concept. We present an integrative pipeline for automating the discovery of interesting simple hypotheses (feature-target relations with effect direction and a potential underlying mechanism) in structured biomedical data. The pipeline combines machine learning, knowledge graphs, literature search and Large Language Models. We formalize "interestingness" as a combination of novelty, utility and plausibility. On 8 major diseases from the UK Biobank, our pipeline consistently recovers risk factors years before their appearance in the literature. 40--53% of our top candidates were validated as interesting, compared to 0--7% for a SHAP-based baseline. Overall, 28% of 109 candidates were interesting to medical experts. The pipeline addresses the challenge of operationalizing "interestingness" scalably and for any target. We release data and code: https://github.com/LinialLab/InterFeat
arXiv.org Artificial Intelligence
Sep-9-2025
- Country:
- Africa > Zambia
- Southern Province > Choma (0.04)
- Asia
- Japan > Honshū
- Chūbu > Toyama Prefecture > Toyama (0.04)
- Middle East > Israel
- Jerusalem District > Jerusalem (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Japan > Honshū
- Europe
- Italy > Tuscany
- Florence (0.04)
- Ukraine > Kyiv Oblast
- Kyiv (0.04)
- United Kingdom > Scotland (0.04)
- Italy > Tuscany
- North America > United States (0.04)
- Oceania
- Australia (0.04)
- New Zealand > North Island
- Waikato (0.04)
- Africa > Zambia
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Education > Health & Safety
- School Nutrition (0.68)
- Health & Medicine
- Consumer Health (1.00)
- Pharmaceuticals & Biotechnology (1.00)
- Therapeutic Area
- Cardiology/Vascular Diseases (1.00)
- Endocrinology (1.00)
- Gastroenterology (1.00)
- Neurology (0.92)
- Oncology (1.00)
- Education > Health & Safety
- Technology: