AITopics | Scientific Discovery

Collaborating Authors

Scientific Discovery

"The problem of giving rules for producing true scientific statements has been replaced by the problem of finding efficient heuristic rules for culling the reasonable candidates for an explanation from an appropriate set of possible candidates [and finding methods for constructing the candidates]."
– B. Buchanan, quoted in Lindley Darden. Recent Work in Computational Scientific Discovery.

News Overviews Instructional Materials AI-Alerts Classics

Abductive Inference in Retrieval-Augmented Language Models: Generating and Validating Missing Premises

Lin, Shiyin

arXiv.org Artificial IntelligenceNov-7-2025

Large Language Models (LLMs) enhanced with retrieval -- commonly referred to as Retrieval-Augmented Generation (RAG) -- have demonstrated strong performance in knowledge-intensive tasks. However, RAG pipelines often fail when retrieved evidence is incomplete, leaving gaps in the reasoning process. In such cases, \emph{abductive inference} -- the process of generating plausible missing premises to explain observations -- offers a principled approach to bridge these gaps. In this paper, we propose a framework that integrates abductive inference into retrieval-augmented LLMs. Our method detects insufficient evidence, generates candidate missing premises, and validates them through consistency and plausibility checks. Experimental results on abductive reasoning and multi-hop QA benchmarks show that our approach improves both answer accuracy and reasoning faithfulness. This work highlights abductive inference as a promising direction for enhancing the robustness and explainability of RAG systems.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2511.0402

Country: North America > United States (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Abductive Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback

Liu, Wanhao, Yang, Zonglin, Wang, Jue, Bing, Lidong, Zhang, Di, Zhou, Dongzhan, Li, Yuqiang, Li, Houqiang, Cambria, Erik, Ouyang, Wanli

arXiv.org Artificial IntelligenceOct-28-2025

Hypothesis ranking is vital for automated scientific discovery, especially in cost-intensive, throughput-limited natural science domains. Current methods focus on pre-experiment ranking, relying solely on language model reasoning without empirical feedback. We introduce experiment-guided ranking, which prioritizes hypotheses based on feedback from prior tests. Due to the impracticality of real experiments, we propose a simulator grounded in domain-specific concepts that models hypothesis performance as a function of similarity to a hidden ground truth, perturbed by noise. Validated against 124 hypotheses with experimentally reported outcomes, the simulator approximates real results with consistent trend alignment. Although deviations exist, they mimic wet-lab noise, promoting more robust ranking strategies. We frame experiment-guided ranking as a sequential decision-making problem and propose an in-context reinforcement learning (ICRL) framework. Our LLM-based policy decomposes hypotheses into functional elements, clusters them by mechanistic roles, and prioritizes recombinations based on feedback. Experiments show our approach significantly outperforms pre-experiment baselines and strong ablations. Our toolkit, comprising the simulator and ICRL framework, enables systematic research on experiment-guided ranking, with the policy serving as a strong proof of concept.

large language model, machine learning, reinforcement learning, (22 more...)

arXiv.org Artificial Intelligence

2505.17873

Country:

Asia > Middle East > Jordan (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Energy (1.00)
Health & Medicine (0.93)
Materials > Chemicals > Commodity Chemicals (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.54)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

AutoSciDACT: Automated Scientific Discovery through Contrastive Embedding and Hypothesis Testing

Bright-Thonney, Samuel, Reissel, Christina, Grosso, Gaia, Woodward, Nathaniel, Govorkova, Katya, Novak, Andrzej, Park, Sang Eon, Moreno, Eric, Harris, Philip

arXiv.org Machine LearningOct-28-2025

Novelty detection in large scientific datasets faces two key challenges: the noisy and high-dimensional nature of experimental data, and the necessity of making statistically robust statements about any observed outliers. While there is a wealth of literature on anomaly detection via dimensionality reduction, most methods do not produce outputs compatible with quantifiable claims of scientific discovery. In this work we directly address these challenges, presenting the first step towards a unified pipeline for novelty detection adapted for the rigorous statistical demands of science. We introduce AutoSciDACT (Automated Scientific Discovery with Anomalous Contrastive Testing), a general-purpose pipeline for detecting novelty in scientific data. AutoSciDACT begins by creating expressive low-dimensional data representations using a contrastive pre-training, leveraging the abundance of high-quality simulated data in many scientific domains alongside expertise that can guide principled data augmentation strategies. These compact embeddings then enable an extremely sensitive machine learning-based two-sample test using the New Physics Learning Machine (NPLM) framework, which identifies and statistically quantifies deviations in observed data relative to a reference distribution (null hypothesis). We perform experiments across a range of astronomical, physical, biological, image, and synthetic datasets, demonstrating strong sensitivity to small injections of anomalous data across all domains.

artificial intelligence, data mining, machine learning, (17 more...)

arXiv.org Machine Learning

2510.21935

Country:

Oceania > New Zealand (0.04)
Oceania > Cook Islands (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
(5 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Health & Medicine (1.00)
Education > Curriculum > Subject-Specific Education (0.34)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

A New Paradigm for Protecting Homes from Disastrous Fires

The New YorkerOct-22-2025, 10:00:00 GMT

Scientists have identified more than fifty ways that houses can ignite. It's possible to defend against all of them--but it's arduous, and homeowners can't do it alone. In June, 2012, hundreds of homes in Mountain Shadows, Colorado, a subdivision in the foothills of the Rockies, were reduced to ash during the wind-whipped Waldo Canyon Fire. On a cul-de-sac called Hot Springs Court, however, four dwellings somehow remained standing. The mystery of their survival nagged at Alex Maranghides, a fire-protection engineer at the National Institute of Standards and Technology (), who worked with several colleagues on a meticulous reconstruction of the fire. How did the homes make it through? Was there something special about them--a fireproof roof, say, or a fancy sprinkler system? The team collected weather reports, topographic data, G.P.S. records from fire engines, photos, videos, and property-damage reports.

california, maranghide, neighbor, (16 more...)

The New Yorker

Country:

North America > United States > Colorado (0.24)
North America > United States > New York (0.05)
South America (0.04)
(8 more...)

Industry:

Law Enforcement & Public Safety > Fire & Emergency Services (1.00)
Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Banking & Finance (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.41)
Information Technology > Artificial Intelligence > Cognitive Science > Creativity & Intelligence (0.41)

Add feedback

From AI for Science to Agentic Science: A Survey on Autonomous Scientific Discovery

Wei, Jiaqi, Yang, Yuejin, Zhang, Xiang, Chen, Yuhan, Zhuang, Xiang, Gao, Zhangyang, Zhou, Dongzhan, Wang, Guangshuai, Gao, Zhiqiang, Cao, Juntai, Qiu, Zijie, Hu, Ming, Ma, Chenglong, Tang, Shixiang, He, Junjun, Song, Chunfeng, He, Xuming, Zhang, Qiang, You, Chenyu, Zheng, Shuangjia, Ding, Ning, Ouyang, Wanli, Dong, Nanqing, Cheng, Yu, Sun, Siqi, Bai, Lei, Zhou, Bowen

arXiv.org Artificial IntelligenceOct-21-2025

Artificial intelligence (AI) is reshaping scientific discovery, evolving from specialized computational tools into autonomous research partners. We position Agentic Science as a pivotal stage within the broader AI for Science paradigm, where AI systems progress from partial assistance to full scientific agency. Enabled by large language models (LLMs), multimodal systems, and integrated research platforms, agentic AI shows capabilities in hypothesis generation, experimental design, execution, analysis, and iterative refinement -- behaviors once regarded as uniquely human. This survey provides a domain-oriented review of autonomous scientific discovery across life sciences, chemistry, materials science, and physics. We unify three previously fragmented perspectives -- process-oriented, autonomy-oriented, and mechanism-oriented -- through a comprehensive framework that connects foundational capabilities, core processes, and domain-specific realizations. Building on this framework, we (i) trace the evolution of AI for Science, (ii) identify five core capabilities underpinning scientific agency, (iii) model discovery as a dynamic four-stage workflow, (iv) review applications across the above domains, and (v) synthesize key challenges and future opportunities. This work establishes a domain-oriented synthesis of autonomous scientific discovery and positions Agentic Science as a structured paradigm for advancing AI-driven research.

large language model, machine learning, purpose representative mechanism key reference, (22 more...)

arXiv.org Artificial Intelligence

2508.14111

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Florida > Miami-Dade County > Miami (0.14)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(11 more...)

Genre:

Workflow (1.00)
Research Report > New Finding (1.00)
Overview (1.00)

Industry:

Materials > Chemicals (1.00)
Information Technology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(7 more...)

Add feedback

Foundation Models for Scientific Discovery: From Paradigm Enhancement to Paradigm Transition

Liu, Fan, Han, Jindong, Lyu, Tengfei, Zhang, Weijia, Yang, Zhe-Rui, Dai, Lu, Liu, Cancheng, Liu, Hao

arXiv.org Artificial IntelligenceOct-20-2025

Foundation models (FMs), such as GPT-4 and AlphaFold, are reshaping the landscape of scientific research. Beyond accelerating tasks such as hypothesis generation, experimental design, and result interpretation, they prompt a more fundamental question: Are FMs merely enhancing existing scientific methodologies, or are they redefining the way science is conducted? In this paper, we argue that FMs are catalyzing a transition toward a new scientific paradigm. We introduce a three-stage framework to describe this evolution: (1) Meta-Scientific Integration, where FMs enhance workflows within traditional paradigms; (2) Hybrid Human-AI Co-Creation, where FMs become active collaborators in problem formulation, reasoning, and discovery; and (3) Autonomous Scientific Discovery, where FMs operate as independent agents capable of generating new scientific knowledge with minimal human intervention. Through this lens, we review current applications and emerging capabilities of FMs across existing scientific paradigms. We further identify risks and future directions for FM-enabled scientific discovery. This position paper aims to support the scientific community in understanding the transformative role of FMs and to foster reflection on the future of scientific discovery. Our project is available at https://github.com/usail-hkust/Awesome-Foundation-Models-for-Scientific-Discovery.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2510.1528

Country:

Asia > China > Hong Kong (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Asia > Middle East > Jordan (0.04)
(5 more...)

Genre:

Research Report (1.00)
Overview (0.87)

Industry:

Health & Medicine > Therapeutic Area (0.67)
Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The scientific discoveries that prove God does exist, according to best-selling French book based on insights from 62 Nobel Prize winners

Daily Mail - Science & techOct-19-2025, 06:56:46 GMT

The watershed moment Trump changed course on Israel after Netanyahu shattered their once-unbreakable bond: 'We felt betrayed' Kim Kardashian stuns onlookers in horrifying MASKED look at one of Hollywood's biggest galas DAPHNE BARAK: How I delivered the final, fatal blow to Andrew's fast-sinking reputation... and why Palace is right to still be deeply concerned Doctors expose the truth about melatonin... as terrifying side effects soar Gavin Newsom melts down as Pentagon plans to fire artillery shells over California highway during'No Kings' protest Olivia Nuzzi's memoir will reveal juicy text messages with RFK Jr. KENNEDY: Here's the truth of weird drug-fueled orgies in Congress that Tucker Carlson is investigating... it makes me sick to my stomach JANA HOCKING: I've uncovered the ultimate new sex secret and had the best night of my life... no wonder more women are trying it Limp Bizkit bassist Sam Rivers dead at 48 as iconic band pays tribute to'once-in-a-lifetime' talent Insiders reveal dark web of power behind earthquake of'No Kings' protests exploding across America Five safe haven investments if the global economy goes into meltdown (and one under the radar fund to buy RIGHT NOW): As more and more experts warn of a devastating fall in share prices... Inside the King's cold phone call that saw Prince Andrew lose his dukedom and have to cancel Sarah Ferguson's 66th birthday party as Epstein scandal exploded '90s icon looks unrecognizable as she teases her most infamous TV scene in bucket hat during rare outing Antonio Banderas and Melanie Griffith's daughter Stella, 29, weds her childhood sweetheart in dreamy Spanish wedding as actor toasts the newlyweds Stephen A. Smith makes racially-charged double standard accusation against LeBron James amid feud The Duchess of Scandal... who is now plain old Sarah: Fergie's humiliating downfall as King makes moves to'protect' her daughters Green Bay Packers' game in jeopardy with team stranded at airport less than 24 hours before kickoff Selena Gomez makes FIRST red carpet appearance with husband Benny Blanco since wedding as their'perfect' honeymoon is revealed READ MORE: Is there a God? It's a question that has been asked since the beginning of time: does God really exist? Traditionally, science has been the counterargument for the existence of a divine creator. However, French mathematicians Olivier Bonnassies and Michel-Yves Bollore now say that science'has become God's ally'. In a new book, the duo have distilled insights from 62 Nobel Prize winners and more than 100 leading scientists to pinpoint the scientific discoveries that could prove God is real.

nobel prize winner, scientific discovery, taylor swift, (12 more...)

Daily Mail - Science & tech

Country:

Asia > Middle East > Israel (0.25)
North America > Canada > Alberta (0.14)
North America > United States > Wyoming (0.04)
(15 more...)

Genre: Personal > Honors > Award (0.34)

Industry:

Media > Television (1.00)
Media > Music (1.00)
Media > Film (1.00)
(8 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Communications > Mobile (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.61)

Add feedback

Rise of the Robochemist

Zhu, Jihong, Huang, Kefeng, Pipe, Jonathon, Horbaczewsky, Chris, Tyrrell, Andy, Fairlamb, Ian J. S.

arXiv.org Artificial IntelligenceOct-14-2025

Abstract--Chemistry, a long-standing discipline, has historically relied on manual and often time-consuming processes. While some automation exists, the field is now on the cusp of a significant evolution driven by the integration of robotics and artificial intelligence (AI), giving rise to the concept of the robochemist: a new paradigm where autonomous systems assist in designing, executing, and analyzing experiments. Robo-chemists integrate mobile manipulators, advanced perception, teleoperation, and data-driven protocols to execute experiments with greater adaptability, reproducibility, and safety. Rather than a fully automated replacement for human chemists, we envisioned the robochemist as a complementary partner that works collaboratively to enhance discovery, enabling a more efficient exploration of chemical space and accelerating innovation in pharmaceuticals, materials science, and sustainable manufacturing. This article traces the technologies, applications, and challenges that define this transformation, highlighting both the opportunities and the responsibilities that accompany the emergence of the robochemist. Ultimately, the future of chemistry is argued to lie in a symbiotic partnership where human intuition and expertise is amplified by robotic precision and AI-driven insight. The field of chemistry, a cornerstone of modern science and industry, has long been characterized by a blend of theoretical insight and practical, hands-on experimentation.

artificial intelligence, creativity & intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2510.10337

Genre: Research Report (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.48)
Materials > Chemicals (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.48)
(2 more...)

Add feedback

Spec-Driven AI for Science: The ARIA Framework for Automated and Reproducible Data Analysis

Chen, Chuke, Luo, Biao, Li, Nan, Wang, Boxiang, Yang, Hang, Guo, Jing, Xu, Ming

arXiv.org Artificial IntelligenceOct-14-2025

The rapid expansion of scientific data has widened the gap between analytical capability and research intent. Existing AI-based analysis tools, ranging from AutoML frameworks to agentic research assistants, either favor automation over transparency or depend on manual scripting that hinders scalability and reproducibility. We present ARIA (Automated Research Intelligence Assistant), a spec-driven, human-in-the-loop framework for automated and interpretable data analysis. ARIA integrates six interoperable layers, namely Command, Context, Code, Data, Orchestration, and AI Module, within a document-centric workflow that unifies human reasoning and machine execution. Through natural-language specifications, researchers define analytical goals while ARIA autonomously generates executable code, validates computations, and produces transparent documentation. Beyond achieving high predictive accuracy, ARIA can rapidly identify optimal feature sets and select suitable models, minimizing redundant tuning and repetitive experimentation. In the Boston Housing case, ARIA discovered 25 key features and determined XGBoost as the best performing model (R square = 0.93) with minimal overfitting. Evaluations across heterogeneous domains demonstrate ARIA's strong performance, interpretability, and efficiency compared with state-of-the-art systems. By combining AI for research and AI for science principles within a spec-driven architecture, ARIA establishes a new paradigm for transparent, collaborative, and reproducible scientific discovery.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2510.11143

Country: