AITopics | Chmiela, Stefan

Collaborating Authors

Chmiela, Stefan

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Euclidean Fast Attention: Machine Learning Global Atomic Representations at Linear Cost

Frank, J. Thorben, Chmiela, Stefan, Müller, Klaus-Robert, Unke, Oliver T.

arXiv.org Artificial IntelligenceDec-11-2024

Long-range correlations are essential across numerous machine learning tasks, especially for data embedded in Euclidean space, where the relative positions and orientations of distant components are often critical for accurate predictions. Self-attention offers a compelling mechanism for capturing these global effects, but its quadratic complexity presents a significant practical limitation. This problem is particularly pronounced in computational chemistry, where the stringent efficiency requirements of machine learning force fields (MLFFs) often preclude accurately modeling long-range interactions. To address this, we introduce Euclidean fast attention (EFA), a linear-scaling attention-like mechanism designed for Euclidean data, which can be easily incorporated into existing model architectures. A core component of EFA are novel Euclidean rotary positional encodings (ERoPE), which enable efficient encoding of spatial information while respecting essential physical symmetries. We empirically demonstrate that EFA effectively captures diverse long-range effects, enabling EFA-equipped MLFFs to describe challenging chemical interactions for which conventional MLFFs yield incorrect results.

artificial intelligence, deep learning, machine learning global atomic representation, (11 more...)

arXiv.org Artificial Intelligence

2412.08541

Country:

Europe > Germany (0.28)
Asia (0.28)

Genre: Research Report (1.00)

Industry: Materials > Chemicals (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

From Peptides to Nanostructures: A Euclidean Transformer for Fast and Stable Machine Learned Force Fields

Frank, J. Thorben, Unke, Oliver T., Müller, Klaus-Robert, Chmiela, Stefan

arXiv.org Artificial IntelligenceSep-21-2023

Recent years have seen vast progress in the development of machine learned force fields (MLFFs) based on ab-initio reference calculations. Despite achieving low test errors, the suitability of MLFFs in molecular dynamics (MD) simulations is being increasingly scrutinized due to concerns about instability. Our findings suggest a potential connection between MD simulation stability and the presence of equivariant representations in MLFFs, but their computational cost can limit practical advantages they would otherwise bring. To address this, we propose a transformer architecture called SO3krates that combines sparse equivariant representations (Euclidean variables) with a self-attention mechanism that can separate invariant and equivariant information, eliminating the need for expensive tensor products. SO3krates achieves a unique combination of accuracy, stability, and speed that enables insightful analysis of quantum properties of matter on unprecedented time and system size scales. To showcase this capability, we generate stable MD trajectories for flexible peptides and supra-molecular structures with hundreds of atoms. Furthermore, we investigate the PES topology for medium-sized chainlike molecules (e.g., small peptides) by exploring thousands of minima. Remarkably, SO3krates demonstrates the ability to strike a balance between the conflicting demands of stability and the emergence of new minimum-energy conformations beyond the training data, which is crucial for realistic exploration tasks in the field of biochemistry.

artificial intelligence, machine learning, stable machine learned force field, (3 more...)

arXiv.org Artificial Intelligence

2309.15126

Genre: Research Report > New Finding (0.53)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.80)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Reconstructing Kernel-based Machine Learning Force Fields with Super-linear Convergence

Blücher, Stefan, Müller, Klaus-Robert, Chmiela, Stefan

arXiv.org Artificial IntelligenceApr-20-2023

Kernel machines have sustained continuous progress in the field of quantum chemistry. In particular, they have proven to be successful in the low-data regime of force field reconstruction. This is because many equivariances and invariances due to physical symmetries can be incorporated into the kernel function to compensate for much larger datasets. So far, the scalability of kernel machines has however been hindered by its quadratic memory and cubical runtime complexity in the number of training points. While it is known, that iterative Krylov subspace solvers can overcome these burdens, their convergence crucially relies on effective preconditioners, which are elusive in practice. Effective preconditioners need to partially pre-solve the learning problem in a computationally cheap and numerically robust manner. Here, we consider the broad class of Nystr\"om-type methods to construct preconditioners based on successively more sophisticated low-rank approximations of the original kernel matrix, each of which provides a different set of computational trade-offs. All considered methods aim to identify a representative subset of inducing (kernel) columns to approximate the dominant kernel spectrum.

artificial intelligence, machine learning, preconditioner, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1021/acs.jctc.2c01304

2212.12737

Country: Europe > Germany (0.46)

Genre: Research Report > New Finding (0.68)

Industry:

Education (0.66)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Kernel Methods (0.34)

Add feedback

Machine Learning Force Fields

Unke, Oliver T., Chmiela, Stefan, Sauceda, Huziel E., Gastegger, Michael, Poltavsky, Igor, Schütt, Kristof T., Tkatchenko, Alexandre, Müller, Klaus-Robert

arXiv.org Machine LearningOct-14-2020

In recent years, the use of Machine Learning (ML) in computational chemistry has enabled numerous advances previously out of reach due to the computational complexity of traditional electronic-structure methods. One of the most promising applications is the construction of ML-based force fields (FFs), with the aim to narrow the gap between the accuracy of ab initio methods and the efficiency of classical FFs. The key idea is to learn the statistical relation between chemical structure and potential energy without relying on a preconceived notion of fixed chemical bonds or knowledge about the relevant interactions. Such universal ML approximations are in principle only limited by the quality and quantity of the reference data used to train them. This review gives an overview of applications of ML-FFs and the chemical insights that can be obtained from them. The core concepts underlying ML-FFs are described in detail and a step-by-step guide for constructing and testing them from scratch is given. The text concludes with a discussion of the challenges that remain to be overcome by the next generation of ML-FFs.

renewable energy, simulation, upstream oil & gas, (25 more...)

arXiv.org Machine Learning

2010.07067

Country:

Europe > Germany (0.92)
North America > United States (0.92)
Europe > United Kingdom > England (0.14)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Materials > Chemicals > Commodity Chemicals > Petrochemicals (0.92)
Health & Medicine > Pharmaceuticals & Biotechnology (0.92)
Energy > Renewable (0.67)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Ensemble Learning of Coarse-Grained Molecular Dynamics Force Fields with a Kernel Approach

Wang, Jiang, Chmiela, Stefan, Müller, Klaus-Robert, Noè, Frank, Clementi, Cecilia

arXiv.org Machine LearningMay-4-2020

Gradient-domain machine learning (GDML) is an accurate and efficient approach to learn a molecular potential and associated force field based on the kernel ridge regression algorithm. Here, we demonstrate its application to learn an effective coarse-grained (CG) model from all-atom simulation data in a sample efficient manner. The coarse-grained force field is learned by following the thermodynamic consistency principle, here by minimizing the error between the predicted coarse-grained force and the all-atom mean force in the coarse-grained coordinates. Solving this problem by GDML directly is impossible because coarse-graining requires averaging over many training data points, resulting in impractical memory requirements for storing the kernel matrices. In this work, we propose a data-efficient and memory-saving alternative. Using ensemble learning and stratified sampling, we propose a 2-layer training scheme that enables GDML to learn an effective coarse-grained model. We illustrate our method on a simple biomolecular system, alanine dipeptide, by reconstructing the free energy landscape of a coarse-grained variant of this molecule. Our novel GDML training scheme yields a smaller free energy error than neural networks when the training set is small, and a comparably high accuracy when the training set is sufficiently large.

artificial intelligence, gdml model, neural network, (17 more...)

arXiv.org Machine Learning

doi: 10.1063/5.0007276

2005.01851

Country:

Europe (0.68)
North America > United States > Texas (0.14)

Genre: Research Report (1.00)

Industry: Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

Add feedback

SchNet: A continuous-filter convolutional neural network for modeling quantum interactions

Schütt, Kristof, Kindermans, Pieter-Jan, Felix, Huziel Enoc Sauceda, Chmiela, Stefan, Tkatchenko, Alexandre, Müller, Klaus-Robert

Neural Information Processing SystemsDec-31-2017

Deep learning has the potential to revolutionize quantum chemistry as it is ideally suited to learn representations for structured data and speed up the exploration of chemical space. While convolutional neural networks have proven to be the first choice for images, audio and video data, the atoms in molecules are not restricted to a grid. Instead, their precise locations contain essential physical information, that would get lost if discretized. Thus, we propose to use continuous-filter convolutional layers to be able to model local correlations without requiring the data to lie on a grid. We apply those layers in SchNet: a novel deep learning architecture modeling quantum interactions in molecules. We obtain a joint model for the total energy and interatomic forces that follows fundamental quantum-chemical principles. Our architecture achieves state-of-the-art performance for benchmarks of equilibrium molecules and molecular dynamics trajectories. Finally, we introduce a more challenging benchmark with chemical and structural variations that suggests the path for further work.

deep learning, molecule, neural network, (16 more...)

Neural Information Processing Systems

Country: Europe > Germany (0.28)

Industry: Materials > Chemicals > Commodity Chemicals > Petrochemicals (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

SchNet: A continuous-filter convolutional neural network for modeling quantum interactions

Schütt, Kristof T., Kindermans, Pieter-Jan, Sauceda, Huziel E., Chmiela, Stefan, Tkatchenko, Alexandre, Müller, Klaus-Robert

arXiv.org Machine LearningDec-19-2017

Deep learning has the potential to revolutionize quantum chemistry as it is ideally suited to learn representations for structured data and speed up the exploration of chemical space. While convolutional neural networks have proven to be the first choice for images, audio and video data, the atoms in molecules are not restricted to a grid. Instead, their precise locations contain essential physical information, that would get lost if discretized. Thus, we propose to use continuous-filter convolutional layers to be able to model local correlations without requiring the data to lie on a grid. We apply those layers in SchNet: a novel deep learning architecture modeling quantum interactions in molecules. We obtain a joint model for the total energy and interatomic forces that follows fundamental quantum-chemical principles. This includes rotationally invariant energy predictions and a smooth, differentiable potential energy surface. Our architecture achieves state-of-the-art performance for benchmarks of equilibrium molecules and molecular dynamics trajectories. Finally, we introduce a more challenging benchmark with chemical and structural variations that suggests the path for further work.

deep learning, molecule, neural network, (16 more...)

arXiv.org Machine Learning

1706.08566

Country: Europe > Germany (0.28)

Genre: Research Report (0.64)

Industry: Materials > Chemicals > Commodity Chemicals > Petrochemicals (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback