AITopics | Scientific Discovery

Collaborating Authors

Scientific Discovery

"The problem of giving rules for producing true scientific statements has been replaced by the problem of finding efficient heuristic rules for culling the reasonable candidates for an explanation from an appropriate set of possible candidates [and finding methods for constructing the candidates]."
– B. Buchanan, quoted in Lindley Darden. Recent Work in Computational Scientific Discovery.

News Overviews Instructional Materials AI-Alerts Classics

From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery

Zheng, Tianshi, Deng, Zheye, Tsang, Hong Ting, Wang, Weiqi, Bai, Jiaxin, Wang, Zihao, Song, Yangqiu

arXiv.org Artificial IntelligenceSep-18-2025

Large Language Models (LLMs) are catalyzing a paradigm shift in scientific discovery, evolving from task-specific automation tools into increasingly autonomous agents and fundamentally redefining research processes and human-AI collaboration. This survey systematically charts this burgeoning field, placing a central focus on the changing roles and escalating capabilities of LLMs in science. Through the lens of the scientific method, we introduce a foundational three-level taxonomy-Tool, Analyst, and Scientist-to delineate their escalating autonomy and evolving responsibilities within the research lifecycle. We further identify pivotal challenges and future research trajectories such as robotic automation, self-improvement, and ethical governance. Overall, this survey provides a conceptual architecture and strategic foresight to navigate and shape the future of AI-driven scientific discovery, fostering both rapid innovation and responsible advancement. Github Repository: https://github.com/HKUST-KnowComp/Awesome-LLM-Scientific-Discovery.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2505.13259

Country: North America > United States (0.28)

Genre:

Research Report (1.00)
Overview (1.00)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback

130-year-old butter bacteria discovered in Danish basement

Breakthroughs, discoveries, and DIY tips sent every weekday. For over a century, simple lactic acid bacteria has been one of the most reliable additives to keep food and drinks safe for over a century. It goes in butter, cheese, and other dairy products to help extend their shelf life. Now, a team in Denmark has uncovered some of the preservation aid's earliest examples. Their findings, published in the, only come after a chance discovery hidden away in the bowels of a university basement.

130-year-old butter bacteria, andrew paul, bacteria, (13 more...)

Popular Science

Country: Europe > Denmark > Capital Region > Copenhagen (0.06)

Genre: Research Report > New Finding (0.38)

Industry:

Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.39)
Health & Medicine > Consumer Health (0.32)
Media > Photography (0.31)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.57)

Add feedback

Querying Climate Knowledge: Semantic Retrieval for Scientific Discovery

Adamu, Mustapha, Zhang, Qi, Pan, Huitong, Latecki, Longin Jan, Dragut, Eduard C.

arXiv.org Artificial IntelligenceSep-15-2025

The growing complexity and volume of climate science literature make it increasingly difficult for researchers to find relevant information across models, datasets, regions, and variables. This paper introduces a domain-specific Knowledge Graph (KG) built from climate publications and broader scientific texts, aimed at improving how climate knowledge is accessed and used. Unlike keyword based search, our KG supports structured, semantic queries that help researchers discover precise connections such as which models have been validated in specific regions or which datasets are commonly used with certain teleconnection patterns. We demonstrate how the KG answers such questions using Cypher queries, and outline its integration with large language models in RAG systems to improve transparency and reliability in climate-related question answering. This work moves beyond KG construction to show its real world value for climate researchers, model developers, and others who rely on accurate, contextual scientific information.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2509.10087

Country: North America > United States (0.97)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.92)
Information Technology > Communications > Web > Semantic Web (0.66)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.65)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

What Are Research Hypotheses?

Wu, Jian, Rajtmajer, Sarah

arXiv.org Artificial IntelligenceSep-3-2025

Over the past decades, alongside advancements in natural language processing, significant attention has been paid to training models to automatically extract, understand, test, and generate hypotheses in open and scientific domains. However, interpretations of the term \emph{hypothesis} for various natural language understanding (NLU) tasks have migrated from traditional definitions in the natural, social, and formal sciences. Even within NLU, we observe differences defining hypotheses across literature. In this paper, we overview and delineate various definitions of hypothesis. Especially, we discern the nuances of definitions across recently published NLU tasks. We highlight the importance of well-structured and well-defined hypotheses, particularly as we move toward a machine-interpretable scholarly record.

artificial intelligence, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

2509.00185

Country:

North America > United States (1.00)
Asia (0.93)
Europe > United Kingdom > England (0.28)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.47)

Add feedback

Nonzero-sum Adversarial Hypothesis Testing Games

Sarath Yasodharan, Patrick Loiseau

Neural Information Processing SystemsAug-20-2025, 09:04:21 GMT

We study nonzero-sum hypothesis testing games that arise in the context of adversarial classification, in both the Bayesian as well as the Neyman-Pearson frameworks. We first show that these games admit mixed strategy Nash equilibria, and then we examine some interesting concentration phenomena of these equilibria.

artificial intelligence, equilibrium, machine learning, (16 more...)

Neural Information Processing Systems

Industry:

Information Technology > Security & Privacy (1.00)
Leisure & Entertainment > Games (0.88)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.65)

Add feedback

Strategic Hypothesis Testing

Hossain, Safwan, Chen, Yatong, Chen, Yiling

arXiv.org Artificial IntelligenceAug-6-2025

We examine hypothesis testing within a principal-agent framework, where a strategic agent, holding private beliefs about the effectiveness of a product, submits data to a principal who decides on approval. The principal employs a hypothesis testing rule, aiming to pick a p-value threshold that balances false positives and false negatives while anticipating the agent's incentive to maximize expected profitability. Building on prior work, we develop a game-theoretic model that captures how the agent's participation and reporting behavior respond to the principal's statistical decision rule. Despite the complexity of the interaction, we show that the principal's errors exhibit clear monotonic behavior when segmented by an efficiently computable critical p-value threshold, leading to an interpretable characterization of their optimal p-value threshold. We empirically validate our model and these insights using publicly available data on drug approvals. Overall, our work offers a comprehensive perspective on strategic interactions within the hypothesis testing framework, providing technical and regulatory insights.

artificial intelligence, machine learning, scientific discovery, (19 more...)

arXiv.org Artificial Intelligence

2508.03289

Country: North America > United States (1.00)

Genre: Research Report > Experimental Study (1.00)

Industry:

Law (1.00)
Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Government > Regional Government > North America Government > United States Government > FDA (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

RAISE: Enhancing Scientific Reasoning in LLMs via Step-by-Step Retrieval

Oh, Minhae, Kim, Jeonghye, Lee, Nakyung, Seo, Donggeon, Kim, Taeuk, Lee, Jungwoo

arXiv.org Artificial IntelligenceAug-5-2025

Scientific reasoning requires not only long-chain reasoning processes, but also knowledge of domain-specific terminologies and adaptation to updated findings. To deal with these challenges for scientific reasoning, we introduce RAISE, a step-by-step retrieval-augmented framework which retrieves logically relevant documents from in-the-wild corpus. RAISE is divided into three steps: problem decomposition, logical query generation, and logical retrieval. We observe that RAISE consistently outperforms other baselines on scientific reasoning benchmarks. We analyze that unlike other baselines, RAISE retrieves documents that are not only similar in terms of the domain knowledge, but also documents logically more relevant.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2506.08625

Country: Asia (0.46)

Genre:

Research Report (0.82)
Workflow (0.69)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Splits! A Flexible Dataset and Evaluation Framework for Sociocultural Linguistic Investigation

Caplan, Eylon, Chakraborty, Tania, Goldwasser, Dan

arXiv.org Artificial IntelligenceAug-1-2025

Variation in language use, shaped by speakers' sociocultural background and specific context of use, offers a rich lens into cultural perspectives, values, and opinions. However, the computational study of these Sociocultural Linguistic Phenomena (SLP) has often been limited to bespoke analyses of specific groups or topics, hindering the pace of scientific discovery. To address this, we introduce Splits!, a 9.7 million-post dataset from Reddit designed for systematic and flexible research. The dataset contains posts from over 53,000 users across 6 demographic groups, organized into 89 discussion topics to enable comparative analysis. We validate Splits! via self-identification and by successfully replicating several known SLPs from existing literature. We complement this dataset with a framework that leverages efficient retrieval methods to rapidly validate potential SLPs (PSLPs) by automatically evaluating whether a given hypothesis is supported by our data. Crucially, to distinguish between novel and obvious insights, the framework incorporates a human-validated measure of a hypothesis's ``unexpectedness.'' We demonstrate that the two-stage process reduces the number of statistically significant findings requiring manual inspection by a factor of 1.5-1.8x, streamlining the discovery of promising phenomena for further investigation.

computational linguistic, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2504.0464

Country:

Europe (1.00)
Asia (1.00)
North America > United States > Texas (0.28)

Genre: Research Report (1.00)

Industry:

Media (1.00)
Leisure & Entertainment (1.00)
Health & Medicine > Therapeutic Area (1.00)
Government (0.93)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.34)

Add feedback

HIAL: A New Paradigm for Hypergraph Active Learning via Influence Maximization

Hou, Yanheng, Li, Xunkai, Li, Zhenjun, Zhou, Bing, Li, Ronghua, Wang, Guoren

arXiv.org Artificial IntelligenceJul-29-2025

In recent years, Hypergraph Neural Networks (HNNs) have demonstrated immense potential in handling complex systems with high-order interactions. However, acquiring large-scale, high-quality labeled data for these models is costly, making Active Learning (AL) a critical technique. Existing Graph Active Learning (GAL) methods, when applied to hypergraphs, often rely on techniques like "clique expansion," which destroys the high-order structural information crucial to a hypergraph's success, thereby leading to suboptimal performance. To address this challenge, we introduce HIAL (Hypergraph Active Learning), a native active learning framework designed specifically for hypergraphs. We innovatively reformulate the Hypergraph Active Learning (HAL) problem as an Influence Maximization task. The core of HIAL is a dual-perspective influence function that, based on our novel "High-Order Interaction-Aware (HOI-Aware)" propagation mechanism, synergistically evaluates a node's feature-space coverage (via Magnitude of Influence, MoI) and its topological influence (via Expected Diffusion Value, EDV). We prove that this objective function is monotone and submodular, thus enabling the use of an efficient greedy algorithm with a formal (1-1/e) approximation guarantee. Extensive experiments on seven public datasets demonstrate that HIAL significantly outperforms state-of-the-art baselines in terms of performance, efficiency, generality, and robustness, establishing an efficient and powerful new paradigm for active learning on hypergraphs.

artificial intelligence, hypergraph, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2507.2049

Country:

Asia > China (0.29)
North America > United States (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.60)
Information Technology > Artificial Intelligence > Cognitive Science > Creativity & Intelligence (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.50)

Add feedback

1,000-year-old medieval sword emerges from Dutch river after chance discovery: 'Barely corroded'

FOX NewsJul-7-2025, 08:00:17 GMT

SOLVA Archaeology Service in Belgium announced the recent discovery of ancient Roman artifacts and remains, including a well-preserved dog, in Velzeke. A remarkable medieval sword with rare symbols was recently put on display in a Dutch museum, over a year after it was found by construction workers unexpectedly. The discovery of the sword was announced by the Netherlands' National Museum of Antiquities (RMO) in Leiden on June 24. The artifact, named the Linschoten Sword, was found in March 2024 during "maintenance dredging activities," the museum said in a press release. Construction workers were struck by a "long piece of iron" while cleaning a small river known as the Korte Linschoten, the statement noted.

000-year-old medieval sword emerge, discovery, river, (12 more...)

FOX News

Country:

Europe > Netherlands > South Holland > Leiden (0.25)
South America > Brazil (0.05)
Europe > Belgium > Flanders (0.05)

Technology:

Information Technology > Data Science > Data Mining (0.40)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.40)

Add feedback