AITopics | chemistry

Collaborating Authors

chemistry

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Nobel Prize winner leaving UC Berkeley for new role in China

Los Angeles TimesJul-10-2026, 17:06:18 GMT

Things to Do in L.A. Tap to enable a layout that focuses on the article. Omar Yaghi, professor at the University of California, Berkeley, speaks during a media conference in Brussels, Oct. 8, 2025, after being one of three scientists awarded the Nobel Prize in chemistry. This is read by an automated voice. Please report any issues or inconsistencies here . See more from the L.A. Times in Google Search.

artificial intelligence, housing & homelessness politics science, social media, (10 more...)

Los Angeles Times

Country:

Asia (1.00)
North America > United States > California > Alameda County > Berkeley (0.25)

Genre: Personal > Honors > Award (1.00)

Industry:

Media > News (0.71)
Information Technology > Services (0.68)
Education > Educational Setting > Higher Education (0.52)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Training a Scientific Reasoning Model for Chemistry

Neural Information Processing SystemsJun-23-2026, 00:43:56 GMT

Reasoning models are large language models that emit a long chain-of-thought before answering, providing both higher accuracy and explicit reasoning for their response. A major question has been whether language model reasoning generalizes beyond mathematics, programming, and logic, where most previous work has focused. We demonstrate that reasoning models can be post-trained for chemistry without additional domain pretraining, and require substantially less data compared to contemporary domain-specific models. We report ether0, a 24B parameter LLM (based on Mistral-Small-24B) that can reason in natural language and respond with chemical structures. This reasoning model was trained with reinforcement learning on 640,730 experimentally-grounded chemistry problems across 375 tasks ranging from synthesizability, to blood-brain barrier permeability, to human receptor activity, to scent. Our model exceeds general-purpose chemistry models, frontier models, and human experts on molecular design tasks. It is also more data efficient relative to specialized models. We anticipate that this method can be applied to train data-efficient language models specialized for tasks across a wide variety of scientific domains.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country: North America > United States (0.67)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

MolVision: Molecular Property Prediction with Vision Language Models

Neural Information Processing SystemsJun-22-2026, 17:05:01 GMT

Molecular property prediction is a fundamental task in computational chemistry with critical applications in drug discovery and materials science. While recent works have explored Large Language Models (LLMs) for this task, they primarily rely on textual molecular representations such as SMILES/SELFIES, which can be ambiguous and structurally less informative. In this work, we introduce MolVision, a novel approach that leverages Vision-Language Models (VLMs) by integrating both molecular structure as images and textual descriptions to enhance property prediction. We construct a benchmark spanning ten diverse datasets, covering classification, regression and description tasks. Evaluating nine different VLMs in zero-shot, few-shot, and fine-tuned settings, we find that visual information improves prediction performance, particularly when combined with efficient fine-tuning strategies such as LoRA. Our results reveal that while visual information alone is insufficient, multimodal fusion significantly enhances generalization across molecular properties. Adaptation of vision encoder for molecular images in conjunction with LoRA further improves the performance.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.87)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.69)
Health & Medicine > Therapeutic Area > Immunology (0.69)
Health & Medicine > Pharmaceuticals & Biotechnology (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)

Add feedback

Bridging the Gap Between Cross-Domain Theory and Practical Application: ACase Study on Molecular Dissolution

Neural Information Processing SystemsJun-22-2026, 10:33:33 GMT

Artificial intelligence (AI) has played a transformative role in chemical research, greatly facilitating the prediction of small molecule properties, simulation of catalytic processes, and material design. These advances are driven by increases in computing power, open source machine learning frameworks, and extensive chemical datasets. However, a persistent challenge is the limited amount of high-quality real-world data, while models calculated based on large amounts of theoretical data are often costly and difficult to deploy, which hinders the applicability of AI models in practical scenarios. In this study, we enhance the prediction of solutesolvent properties by proposing a novel sample selection method: Core Subset Iterative Extraction (CSIE). CSIE iteratively updates the core sample subset based on information gain to remove redundant samples in theoretical data and optimize the performance of the model on real chemical datasets. Furthermore, we introduce an asymmetric molecular interaction graph neural network (AMGNN) that combines positional information and bidirectional edge connections to simulate real-world chemical reaction scenarios to better capture solute-solvent interactions. Experimental results show that our method can accurately extract the core subset and improve the prediction accuracy. Code is available at: https://CISE-AMGNN.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Materials > Chemicals > Commodity Chemicals > Petrochemicals (0.90)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ChemX: ACollection of Chemistry Datasets for Benchmarking Automated Information Extraction

Neural Information Processing SystemsJun-20-2026, 17:38:28 GMT

Despite recent advances in machine learning, many scientific discoveries in chemistry still rely on manually curated datasets extracted from the scientific literature. Automation of information extraction in specialized chemistry domains has the potential to scale up machine learning applications and improve the quality of predictions, enabling data-driven scientific discoveries at a faster pace. In this paper, we present ChemX, a collection of 10 benchmarking datasets across several domains of chemistry providing a reliable basis for evaluating and fine-tuning automated information extraction methods. The datasets encompassing various properties of small molecules and nanomaterials have been manually extracted from peer-reviewed publications and systematically validated by domain experts through a cross-verification procedure allowing for identification and correction of errors at sources. In order to demonstrate the utility of the resulting datasets, we evaluate the extraction performance of the state-of-the-art large language models (LLMs). Moreover, we design our own agentic approach to take full control of the document preprocessing before LLM-based information extraction.

data mining, large language model, machine learning, (21 more...)

Neural Information Processing Systems

Country: Europe > Russia (0.28)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.68)
Materials > Chemicals > Commodity Chemicals > Petrochemicals (0.68)
Energy (0.67)

Technology:

Information Technology > Data Science > Data Mining > Text Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

The Catechol Benchmark: Time-series Solvent Selection Data for Few-shot Machine Learning

Neural Information Processing SystemsJun-13-2026, 20:21:26 GMT

Machine learning has promised to change the landscape of laboratory chemistry, with impressive results in molecular property prediction and reaction retro-synthesis. However, chemical datasets are often inaccessible to the machine learning community as they tend to require cleaning, thorough understanding of the chemistry, or are simply not available. In this paper, we introduce a novel dataset for yield prediction, providing the first-ever transient flow dataset for machine learning benchmarking, covering over 1200 process conditions. While previous datasets focus on discrete parameters, our experimental set-up allow us to sample a large number of continuous process conditions, generating new challenges for machine learning models. We focus on solvent selection, a task that is particularly difficult to model theoretically and therefore ripe for machine learning applications.

artificial intelligence, machine learning, proceedings, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

ChemX: A Collection of Chemistry Datasets for Benchmarking Automated Information Extraction

Neural Information Processing SystemsJun-13-2026, 13:13:29 GMT

information extraction, large language model, natural language, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.81)

Add feedback

The race to solve the biggest problem in quantum computing

New ScientistMar-13-2026, 07:00:40 GMT

The errors that quantum computers make are holding the technology back. Quantum computers won't be truly useful until they can correct their mistakes Quantum computers are already here, but they make far too many errors. This is arguably the biggest obstacle to the technology really becoming useful, but recent breakthroughs suggest a solution may be on the horizon. Errors creep into traditional computers too, but there are well-established techniques for correcting them. They rely on redundancy, where extra bits are used to detect when 0s incorrectly swap to 1s or vice versa.

artificial intelligence, quantum computer, social media, (15 more...)

New Scientist

Country:

North America > United States (0.05)
Asia > China (0.05)

Industry:

Marketing (0.43)
Health & Medicine > Therapeutic Area (0.31)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (0.92)

Add feedback

Can LLMs Solve Molecule Puzzles A Benchmark for Molecular Structure Elucidation

Neural Information Processing SystemsFeb-18-2026, 16:45:01 GMT

LLMs with improved chemical reasoning capabilities in the other two sub-tasks.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Education (0.92)
Materials > Chemicals (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The science of soulmates: Is there someone out there exactly right for you?

BBC NewsFeb-13-2026, 00:01:47 GMT

The science of soulmates: Is there someone out there exactly right for you? On Valentine's Day, there's the temptation to believe that somewhere out there is The One: a soulmate, a perfect match, the person you were meant to be with. Across history, humans have always been drawn to the idea that love isn't random. In ancient Greece, Plato imagined that we were once whole beings with four arms, four legs and two faces, so radiant that Zeus split us in two; ever since, each half has roamed the earth searching for its missing other, a myth that gives the modern soulmate its poetic pedigree and the promise that somewhere, someone will finally make us feel complete. In the Middle Ages, troubadours and Arthurian tales recast that longing as courtly love, a fierce, often forbidden devotion like Lancelot's for Guinevere, in which a knight proved his worth through self-sacrifice for a beloved he might never openly declare.

artificial intelligence, soulmate, university, (17 more...)

BBC News

Country:

Europe (1.00)
North America > United States > Utah (0.14)
North America > United States > Tennessee (0.14)

Genre: Research Report (0.47)

Industry: Leisure & Entertainment (0.70)

Technology: Information Technology > Artificial Intelligence (0.47)

Add feedback