AITopics | Rittig, Jan G.

Collaborating Authors

Rittig, Jan G.

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Predicting the Temperature-Dependent CMC of Surfactant Mixtures with Graph Neural Networks

Brozos, Christoforos, Rittig, Jan G., Akanny, Elie, Bhattacharya, Sandip, Kohlmann, Christina, Mitsos, Alexander

arXiv.org Artificial IntelligenceNov-5-2024

Surfactants are key ingredients in foaming and cleansing products across various industries such as personal and home care, industrial cleaning, and more, with the critical micelle concentration (CMC) being of major interest. Predictive models for CMC of pure surfactants have been developed based on recent ML methods, however, in practice surfactant mixtures are typically used due to to performance, environmental, and cost reasons. This requires accounting for synergistic/antagonistic interactions between surfactants; however, predictive ML models for a wide spectrum of mixtures are missing so far. Herein, we develop a graph neural network (GNN) framework for surfactant mixtures to predict the temperature-dependent CMC. We collect data for 108 surfactant binary mixtures, to which we add data for pure species from our previous work [Brozos et al. (2024), J. Chem. Theory Comput.]. We then develop and train GNNs and evaluate their accuracy across different prediction test scenarios for binary mixtures relevant to practical applications. The final GNN models demonstrate very high predictive performance when interpolating between different mixture compositions and for new binary mixtures with known species. Extrapolation to binary surfactant mixtures where either one or both surfactant species are not seen before, yields accurate results for the majority of surfactant systems. We further find superior accuracy of the GNN over a semi-empirical model based on activity coefficients, which has been widely used to date. We then explore if GNN models trained solely on binary mixture and pure species data can also accurately predict the CMCs of ternary mixtures. Finally, we experimentally measure the CMC of 4 commercial surfactants that contain up to four species and industrial relevant mixtures and find a very good agreement between measured and predicted CMC values.

artificial intelligence, machine learning, surfactant, (17 more...)

arXiv.org Artificial Intelligence

2411.02224

Country: Europe > Germany > North Rhine-Westphalia (0.14)

Genre: Research Report (1.00)

Industry: Materials > Chemicals > Specialty Chemicals (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

GraphXForm: Graph transformer for computer-aided molecular design with application to extraction

Pirnay, Jonathan, Rittig, Jan G., Wolf, Alexander B., Grohe, Martin, Burger, Jakob, Mitsos, Alexander, Grimm, Dominik G.

arXiv.org Artificial IntelligenceNov-3-2024

Generative deep learning has become pivotal in molecular design for drug discovery and materials science. A widely used paradigm is to pretrain neural networks on string representations of molecules and fine-tune them using reinforcement learning on specific objectives. However, string-based models face challenges in ensuring chemical validity and enforcing structural constraints like the presence of specific substructures. We propose to instead combine graph-based molecular representations, which can naturally ensure chemical validity, with transformer architectures, which are highly expressive and capable of modeling long-range dependencies between atoms. Our approach iteratively modifies a molecular graph by adding atoms and bonds, which ensures chemical validity and facilitates the incorporation of structural constraints. We present GraphXForm, a decoder-only graph transformer architecture, which is pretrained on existing compounds and then fine-tuned using a new training algorithm that combines elements of the deep cross-entropy method with self-improvement learning from language modeling, allowing stable fine-tuning of deep transformers with many layers. We evaluate GraphXForm on two solvent design tasks for liquid-liquid extraction, showing that it outperforms four state-of-the-art molecular design techniques, while it can flexibly enforce structural constraints or initiate the design from existing molecular structures.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2411.01667

Genre: Research Report (0.81)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Materials > Chemicals > Commodity Chemicals > Petrochemicals (0.37)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Graph Neural Networks for Surfactant Multi-Property Prediction

Brozos, Christoforos, Rittig, Jan G., Bhattacharya, Sandip, Akanny, Elie, Kohlmann, Christina, Mitsos, Alexander

arXiv.org Artificial IntelligenceJan-3-2024

Surfactants are of high importance in different industrial sectors such as cosmetics, detergents, oil recovery and drug delivery systems. Therefore, many quantitative structure-property relationship (QSPR) models have been developed for surfactants. Each predictive model typically focuses on one surfactant class, mostly nonionics. Graph Neural Networks (GNNs) have exhibited a great predictive performance for property prediction of ionic liquids, polymers and drugs in general. Specifically for surfactants, GNNs can successfully predict critical micelle concentration (CMC), a key surfactant property associated with micellization. A key factor in the predictive ability of QSPR and GNN models is the data available for training. Based on extensive literature search, we create the largest available CMC database with 429 molecules and the first large data collection for surface excess concentration ($\Gamma$$_{m}$), another surfactant property associated with foaming, with 164 molecules. Then, we develop GNN models to predict the CMC and $\Gamma$$_{m}$ and we explore different learning approaches, i.e., single- and multi-task learning, as well as different training strategies, namely ensemble and transfer learning. We find that a multi-task GNN with ensemble learning trained on all $\Gamma$$_{m}$ and CMC data performs best. Finally, we test the ability of our CMC model to generalize on industrial grade pure component surfactants. The GNN yields highly accurate predictions for CMC, showing great potential for future industrial applications.

artificial intelligence, machine learning, survey article, (18 more...)

arXiv.org Artificial Intelligence

2401.01874

Country:

North America > United States (0.67)
Europe > Germany > North Rhine-Westphalia (0.14)

Genre: Research Report > New Finding (0.46)

Industry:

Materials > Chemicals > Specialty Chemicals (1.00)
Energy > Oil & Gas > Upstream (1.00)
Materials > Chemicals > Commodity Chemicals > Petrochemicals (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Gibbs-Duhem-Informed Neural Networks for Binary Activity Coefficient Prediction

Rittig, Jan G., Felton, Kobi C., Lapkin, Alexei A., Mitsos, Alexander

arXiv.org Artificial IntelligenceSep-14-2023

We propose Gibbs-Duhem-informed neural networks for the prediction of binary activity coefficients at varying compositions. That is, we include the Gibbs-Duhem equation explicitly in the loss function for training neural networks, which is straightforward in standard machine learning (ML) frameworks enabling automatic differentiation. In contrast to recent hybrid ML approaches, our approach does not rely on embedding a specific thermodynamic model inside the neural network and corresponding prediction limitations. Rather, Gibbs-Duhem consistency serves as regularization, with the flexibility of ML models being preserved. Our results show increased thermodynamic consistency and generalization capabilities for activity coefficient predictions by Gibbs-Duhem-informed graph neural networks and matrix completion methods. We also find that the model architecture, particularly the activation function, can have a strong influence on the prediction quality. The approach can be easily extended to account for other thermodynamic consistency conditions.

artificial intelligence, binary activity coefficient prediction, machine learning, (1 more...)

arXiv.org Artificial Intelligence

doi: 10.1039/D3DD00103B

2306.07937

Genre: Research Report > New Finding (0.53)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Graph Machine Learning for Design of High-Octane Fuels

Rittig, Jan G., Ritzert, Martin, Schweidtmann, Artur M., Winkler, Stefanie, Weber, Jana M., Morsch, Philipp, Heufer, K. Alexander, Grohe, Martin, Mitsos, Alexander, Dahmen, Manuel

arXiv.org Artificial IntelligenceOct-14-2022

Fuels with high-knock resistance enable modern spark-ignition engines to achieve high efficiency and thus low CO2 emissions. Identification of molecules with desired autoignition properties indicated by a high research octane number and a high octane sensitivity is therefore of great practical relevance and can be supported by computer-aided molecular design (CAMD). Recent developments in the field of graph machine learning (graph-ML) provide novel, promising tools for CAMD. We propose a modular graph-ML CAMD framework that integrates generative graph-ML models with graph neural networks and optimization, enabling the design of molecules with desired ignition properties in a continuous molecular space. In particular, we explore the potential of Bayesian optimization and genetic algorithms in combination with generative graph-ML models. The graph-ML CAMD framework successfully identifies well-established high-octane components. It also suggests new candidates, one of which we experimentally investigate and use to illustrate the need for further auto-ignition training data.

artificial intelligence, machine learning, survey article, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1002/aic.17971

2206.00619

Country:

North America > United States (0.46)
Europe > Netherlands (0.28)
Europe > Switzerland (0.28)

Genre: Research Report (1.00)

Industry:

Energy > Renewable (1.00)
Energy > Oil & Gas > Downstream (0.93)
Health & Medicine > Pharmaceuticals & Biotechnology (0.68)
Materials > Chemicals > Commodity Chemicals > Petrochemicals (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Physical Pooling Functions in Graph Neural Networks for Molecular Property Prediction

Schweidtmann, Artur M., Rittig, Jan G., Weber, Jana M., Grohe, Martin, Dahmen, Manuel, Leonhard, Kai, Mitsos, Alexander

arXiv.org Artificial IntelligenceJul-27-2022

Graph neural networks (GNNs) are emerging in chemical engineering for the end-to-end learning of physicochemical properties based on molecular graphs. A key element of GNNs is the pooling function which combines atom feature vectors into molecular fingerprints. Most previous works use a standard pooling function to predict a variety of properties. However, unsuitable pooling functions can lead to unphysical GNNs that poorly generalize. We compare and select meaningful GNN pooling methods based on physical knowledge about the learned properties. The impact of physical pooling functions is demonstrated with molecular properties calculated from quantum mechanical computations. We also compare our results to the recent set2set pooling approach. We recommend using sum pooling for the prediction of properties that depend on molecular size and compare pooling functions for properties that are molecular size-independent. Overall, we show that the use of physical pooling functions significantly enhances generalization.

artificial intelligence, machine learning, physical pooling function, (16 more...)

arXiv.org Artificial Intelligence

2207.13779

Country:

Europe (0.94)
North America > United States > New York (0.28)

Genre: Research Report > New Finding (0.34)

Industry: Materials > Chemicals (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Graph Neural Networks for Temperature-Dependent Activity Coefficient Prediction of Solutes in Ionic Liquids

Rittig, Jan G., Hicham, Karim Ben, Schweidtmann, Artur M., Dahmen, Manuel, Mitsos, Alexander

arXiv.org Artificial IntelligenceJun-23-2022

Ionic liquids (ILs) are important solvents for sustainable processes and predicting activity coefficients (ACs) of solutes in ILs is needed. Recently, matrix completion methods (MCMs), transformers, and graph neural networks (GNNs) have shown high accuracy in predicting ACs of binary mixtures, superior to well-established models, e.g., COSMO-RS and UNIFAC. GNNs are particularly promising here as they learn a molecular graph-to-property relationship without pretraining, typically required for transformers, and are, unlike MCMs, applicable to molecules not included in training. For ILs, however, GNN applications are currently missing. Herein, we present a GNN to predict temperature-dependent infinite dilution ACs of solutes in ILs. We train the GNN on a database including more than 40,000 AC values and compare it to a state-of-the-art MCM. The GNN and MCM achieve similar high prediction performance, with the GNN additionally enabling high-quality predictions for ACs of solutions that contain ILs and solutes not considered during training.

artificial intelligence, machine learning, prediction, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.compchemeng.2023.108153

2206.11776

Country: Europe (0.68)

Genre: Research Report (0.82)

Industry:

Energy (1.00)
Materials > Chemicals > Commodity Chemicals > Petrochemicals (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback