AITopics

The chemist begins by thoroughly analyzing the molecular structure--recognizing the core scaffold, functional groups, stereochemical configurations, and the relative positions of these structural elements. With this understanding, the chemist reasons about potential modifications, such as substituting functional groups, adjusting ring systems, or altering stereochemistry, to improve the target property. Finally, guided by precise modification instructions, the chemist applies these changes to generate an optimized molecule (illustrated in Figure 1a). In another scenario, a chemist may need to design a new (de novo) molecule to satisfy a set of property requirements. This process involves reasoning about possible molecular structures that meet the design constraints and then synthesizing a novel compound based on the resulting detailed structural specification (shown in Figure 1b). Recent advancements in artificial intelligence (AI) provide promising opportunities to assist chemists in these complex workflows. While traditional molecular models, based on graph or sequence representations such as SMILES (Weininger, 1988), have demonstrated strong performance in domain-specific tasks including molecular property prediction (Huang et al., 2021), reaction prediction (Lowe, 2017), and conditional generation (Brown et al., 2019), they operate exclusively within the molecular modality and lack the ability to interpret or execute natural language instructions. Meanwhile, large language models (LLMs) have exhibited remarkable reasoning capabilities (OpenAI, 2025d), showing promise in tackling scientific problems that traditionally require Clemson University.

large language model, machine learning, natural language, (17 more...)

2505.15054

Genre:

Research Report (1.00)
Workflow (0.68)

Industry:

Law (0.54)
Materials > Chemicals (0.46)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Hanson, Nathaniel, Allison, Austin, DiMarzio, Charles, Padır, Taşkın, Dorsey, Kristen L.

SCANS: A Soft Gripper with Curvature and Spectroscopy Sensors for In-Hand Material Differentiation

We introduce the soft curvature and spectroscopy (SCANS) system: a versatile, electronics-free, fluidically actuated soft manipulator capable of assessing the spectral properties of objects either in hand or through pre-touch caging. This platform offers a wider spectral sensing capability than previous soft robotic counterparts. We perform a material analysis to explore optimal soft substrates for spectral sensing, and evaluate both pre-touch and in-hand performance. Experiments demonstrate explainable, statistical separation across diverse object classes and sizes (metal, wood, plastic, organic, paper, foam), with large spectral angle differences between items. Through linear discriminant analysis, we show that sensitivity in the near-infrared wavelengths is critical to distinguishing visually similar objects. These capabilities advance the potential of optics as a multi-functional sensory modality for soft robots. The complete parts list, assembly guidelines, and processing code for the SCANS gripper are accessible at: https://parses-lab.github.io/scans/.

artificial intelligence, machine learning, waveguide, (18 more...)

2510.02164

Country: North America > United States > Massachusetts (0.46)

Genre: Research Report (0.64)

Industry: Materials > Chemicals > Commodity Chemicals (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Catalyst GFlowNet for electrocatalyst design: A hydrogen evolution reaction case study

Podina, Lena, Humer, Christina, Duval, Alexandre, Schmidt, Victor, Ramlaoui, Ali, Chatterjee, Shahana, Bengio, Yoshua, Hernandez-Garcia, Alex, Rolnick, David, Therrien, Félix

Efficient and inexpensive energy storage is essential for accelerating the adoption of renewable energy and ensuring a stable supply, despite fluctuations in sources such as wind and solar. Electrocatalysts play a key role in hydrogen energy storage (HES), allowing the energy to be stored as hydrogen. However, the development of affordable and high-performance catalysts for this process remains a significant challenge. We introduce Catalyst GFlowNet, a generative model that leverages machine learning-based predictors of formation and adsorption energy to design crystal surfaces that act as efficient catalysts. We demonstrate the performance of the model through a proof-of-concept application to the hydrogen evolution reaction, a key reaction in HES, for which we successfully identified platinum as the most efficient known catalyst. In future work, we aim to extend this approach to the oxygen evolution reaction, where current optimal catalysts are expensive metal oxides, and open the search space to discover new materials. This generative modeling framework offers a promising pathway for accelerating the search for novel and efficient catalysts.

artificial intelligence, catalyst, machine learning, (15 more...)

2510.02142

Country: North America > Canada > Quebec (0.15)

Genre: Research Report (0.50)

Industry:

Materials > Chemicals > Specialty Chemicals (1.00)
Energy > Renewable > Hydrogen (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.90)

Parys, Paweł, Vaidya, Sairam, Berg-Kirkpatrick, Taylor, D'Antoni, Loris

Constrained Adaptive Rejection Sampling

Language Models (LMs) are increasingly used in applications where generated outputs must satisfy strict semantic or syntactic constraints. Existing approaches to constrained generation fall along a spectrum: greedy constrained decoding methods enforce validity during decoding but distort the LM's distribution, while rejection sampling (RS) preserves fidelity but wastes computation by discarding invalid outputs. Both extremes are problematic in domains such as program fuzzing, where both validity and diversity of samples are essential. We present Constrained Adaptive Rejection Sampling (CARS), an approach that strictly improves the sample-efficiency of RS without distributional distortion. CARS begins with unconstrained LM sampling and adaptively rules out constraint-violating continuations by recording them in a trie and subtracting their probability mass from future draws. This adaptive pruning ensures that prefixes proven invalid are never revisited, acceptance rates improve monotonically, and the resulting samples exactly follow the constrained distribution. In experiments on a variety of domains -- e.g., program fuzzing and molecular generation -- CARS consistently achieves higher efficiency -- measured in the number of LM forward passes per valid sample -- while also producing stronger sample diversity than both GCD and methods that approximate the LM's distribution.

large language model, machine learning, natural language, (17 more...)

2510.01902

Country: North America > United States (0.68)

Genre: Research Report (1.00)

Industry: Materials > Chemicals > Commodity Chemicals (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Hongzhou Lin, Julien Mairal, Zaid Harchaoui

A Universal Catalyst for First-Order Optimization

Neural Information Processing SystemsOct-2-2025, 13:23:38 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, convergence rate, objective, (16 more...)

Neural Information Processing Systems

Industry: Materials > Chemicals > Specialty Chemicals (0.43)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

arXiv.org Artificial IntelligenceOct-2-2025

Benchmarking Agentic Systems in Automated Scientific Information Extraction with ChemX

Vepreva, Anastasia, Razlivina, Julia, Eremeeva, Maria, Gubina, Nina, Orlova, Anastasia, Dmitrenko, Aleksei, Kapranova, Ksenya, Jyakhwo, Susan, Vasilev, Nikita, Sarkisyan, Arsen, Chernyshov, Ivan Yu., Vinogradov, Vladimir, Dmitrenko, Andrei

The emergence of agent-based systems represents a significant advancement in artificial intelligence, with growing applications in automated data extraction. However, chemical information extraction remains a formidable challenge due to the inherent heterogeneity of chemical data. Current agent-based approaches, both general-purpose and domain-specific, exhibit limited performance in this domain. To address this gap, we present ChemX, a comprehensive collection of 10 manually curated and domain-expert-validated datasets focusing on nanomaterials and small molecules. These datasets are designed to rigorously evaluate and enhance automated extraction methodologies in chemistry. To demonstrate their utility, we conduct an extensive benchmarking study comparing existing state-of-the-art agentic systems such as ChatGPT Agent and chemical-specific data extraction agents. Additionally, we introduce our own single-agent approach that enables precise control over document preprocessing prior to extraction. We further evaluate the performance of modern baselines, such as GPT-5 and GPT-5 Thinking, to compare their capabilities with agentic approaches. Our empirical findings reveal persistent challenges in chemical information extraction, particularly in processing domain-specific terminology, complex tabular and schematic representations, and context-dependent ambiguities. The ChemX benchmark serves as a critical resource for advancing automated information extraction in chemistry, challenging the generalization capabilities of existing methods, and providing valuable insights into effective evaluation strategies.

data mining, large language model, machine learning, (21 more...)

2510.00795

Country:

Europe > Switzerland (0.28)
Europe > Russia (0.28)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Materials > Chemicals (0.93)
Health & Medicine > Therapeutic Area (0.68)

Technology:

Information Technology > Data Science > Data Mining > Text Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Brandon Anderson, Truong Son Hy, Risi Kondor

Cormorant: Covariant Molecular Neural Networks

Neural Information Processing SystemsOct-1-2025, 23:27:38 GMT

We propose Cormorant, a rotationally covariant neural network architecture for learning the behavior and properties of complex many-body physical systems. We apply these networks to molecular systems with two goals: learning atomic potential energy surfaces for use in Molecular Dynamics simulations, and learning ground state properties of molecules calculated by Density Functional Theory. Some of the key features of our network are that (a) each neuron explicitly corresponds to a subset of atoms; (b) the activation of each neuron is covariant to rotations, ensuring that overall the network is fully rotationally invariant. Furthermore, the non-linearity in our network is based upon tensor products and the Clebsch-Gordan decomposition, allowing the network to operate entirely in Fourier space. Cormorant significantly outperforms competing algorithms in learning molecular Potential Energy Surfaces from conformational geometries in the MD-17 dataset, and is competitive with other methods at learning geometric, energetic, electronic, and thermodynamic properties of molecules on the GDB-9 dataset.

artificial intelligence, interaction, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.47)

Industry:

Materials > Chemicals (0.46)
Energy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsOct-1-2025, 22:12:57 GMT

Learning Composable Energy Surrogates for PDE Order Reduction

To address this, we leverage parametric modular structure to learn component-level surrogates, enabling cheaper high-fidelity simulation. We use a neural network to model the stored potential energy in a component given boundary conditions.

artificial intelligence, displacement, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.05)
North America > Canada (0.04)

Industry:

Materials (0.46)
Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

RobohubOct-1-2025, 09:36:22 GMT

Rethinking how robots move: Light and AI drive precise motion in soft robotic arm

Researchers at Rice University have developed a soft robotic arm capable of performing complex tasks such as navigating around an obstacle or hitting a ball, guided and powered remotely by laser beams without any onboard electronics or wiring. The research could inform new ways to control implantable surgical devices or industrial machines that need to handle delicate objects. In a proof-of-concept study that integrates smart materials, machine learning and an optical control system, a team of Rice researchers led by materials scientist Hanyu Zhu used a light-patterning device to precisely induce motion in a robotic arm made from azobenzene liquid crystal elastomer a type of polymer that responds to light. According to the study published in Advanced Intelligent Systems, the new robotic system incorporates a neural network trained to predict the exact light pattern needed to create specific arm movements. This makes it easier for the robot to execute complex tasks without needing similarly complex input from an operator.

ai drive precise motion, robotic arm, soft robotic arm, (11 more...)

Robohub

Country:

Europe > France (0.06)
North America > United States > Colorado (0.05)
Europe > Switzerland > Zürich > Zürich (0.05)

Genre: Research Report (0.55)

Industry:

Materials (0.78)
Leisure & Entertainment > Sports > Soccer (0.32)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.36)

Du, Angel Pan, Arana-Catania, Miguel, Gutiérrez, Enric Grustan

AI-assisted Advanced Propellant Development for Electric Propulsion

arXiv.org Artificial IntelligenceOct-1-2025

Artificial Intelligence algorithms are introduced in this work as a tool to predict the performance of new chemical compounds as alternative propellants for electric propulsion, focusing on predicting their ionisation characteristics and fragmentation patterns. The chemical properties and structure of the compounds are encoded using a chemical fingerprint, and the training datasets are extracted from the NIST WebBook. The AI-predicted ionisation energy and minimum appearance energy have a mean relative error of 6.87% and 7.99%, respectively, and a predicted ion mass with a 23.89% relative error. In the cases of full mass spectra due to electron ionisation, the predictions have a cosine similarity of 0.6395 and align with the top 10 most similar mass spectra in 78% of instances within a 30 Da range.

artificial intelligence, machine learning, prediction, (18 more...)

2509.26567

Country:

Europe (0.68)
North America > United States > New Jersey (0.14)

Genre: Research Report (0.82)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.68)
Materials > Chemicals (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)