quantum chemical calculation
Diffusion-based Generative AI for Exploring Transition States from 2D Molecular Graphs
Kim, Seonghwan, Woo, Jeheon, Kim, Woo Youn
The exploration of transition state (TS) geometries is crucial for elucidating chemical reaction mechanisms and modeling their kinetics. Recently, machine learning (ML) models have shown remarkable performance for prediction of TS geometries. However, they require 3D conformations of reactants and products often with their appropriate orientations as input, which demands substantial efforts and computational cost. Here, we propose a generative approach based on the stochastic diffusion method, namely TSDiff, for prediction of TS geometries just from 2D molecular graphs. TSDiff outperformed the existing ML models with 3D geometries in terms of both accuracy and efficiency. Moreover, it enables to sample various TS conformations, because it learned the distribution of TS geometries for diverse reactions in training. Thus, TSDiff was able to find more favorable reaction pathways with lower barrier heights than those in the reference database. These results demonstrate that TSDiff shows promising potential for an efficient and reliable TS exploration.
PubChemQC B3LYP/6-31G*//PM6 dataset: the Electronic Structures of 86 Million Molecules using B3LYP/6-31G* calculations
Nakata, Maho, Maeda, Toshiyuki
This article presents the "PubChemQC B3LYP/6-31G*//PM6" dataset, containing electronic properties of 85,938,443 molecules. It includes orbitals, orbital energies, total energies, dipole moments, and other relevant properties. The dataset encompasses a wide range of molecules, from essential compounds to biomolecules up to 1000 molecular weight, covering 94.0% of the original PubChem Compound catalog (as of August 29, 2016). The electronic properties were calculated using the B3LYP/6-31G* and PM6 methods. The dataset is available in three formats: (i) GAMESS quantum chemistry program files, (ii) selected JSON output files, and (iii) a PostgreSQL database, enabling researchers to query molecular properties. Five sub-datasets offer more specific data. The first two subsets include molecules with C, H, O, and N, under 300 and 500 molecular weight respectively. The third and fourth subsets contain C, H, N, O, P, S, F, and Cl, under 300 and 500 molecular weight respectively. The fifth subset includes C, H, N, O, P, S, F, Cl, Na, K, Mg, and Ca, under 500 molecular weight. Coefficients of determination ranged from 0.892 (CHON500) to 0.803 (whole) for the HOMO-LUMO energy gap. These findings represent extensive investigations and can be utilized for drug discovery, material science, and other applications. The datasets are available under the Creative Commons Attribution 4.0 International license at https://nakatamaho.riken.jp/pubchemqc.riken.jp/b3lyp_pm6_datasets.html.
Deep learning to perform quantum chemical calculations.
If we want to calculate the physical properties of matter (such as the electronic state), we need to describe the state of the electron. The equations of motion that we are familiar with cannot describe the states of small objects such as electrons, so we need to use something called quantum mechanics. In quantum mechanics, the state of an electron is described by a complex function, the "wave function". "wave function" is, roughly speaking, electrons' orbitals. The equation below, called the Schrödinger equation, is a basic equation in quantum mechanics that shows the relationship between the wave function and the energy (Hamiltonian, H-hat on the right side), where ψ is the wave function.
Machine learning in quantum chemistry - using machine learning to improve the results of quantum chemical calculations at University of Sheffield on FindAPhD.com
This is a self-funded project. The applicant should have or expect to gain at least an upper second class degree, or equivalent overseas qualification, in a relevant subject. If you have the correct qualifications and access to your own funding, either from your home country or your own finances, your application to work with this supervisor will be considered.
Machine learning enables long time scale molecular photodynamics simulations
Westermayr, Julia, Gastegger, Michael, Menger, Maximilian F. S. J., Mai, Sebastian, González, Leticia, Marquetand, Philipp
Abstract: Photo-inducedprocesses are fundamental in nature, but accurate simulations are seriously limited by the cost of the underlying quantum chemical calculations, hampering their application for long time scales. Here we introduce a method based on machine learning to overcome this bottleneck and enable accurate photodynamics on nanosecond time scales, which are otherwise out of reach with contemporary approaches. Instead of expensive quantum chemistry during molecular dynamics simulations, we use deep neural networks to learn the relationship between a molecular geometry and its high-dimensional electronic properties. As an example, the time evolution of the methylenimmonium cation for one nanosecond is used to demonstrate that machine learning algorithms can outperform standard excited-state molecular dynamics approaches in their computational efficiency while delivering the same accuracy. Introduction Machine learning (ML) is revolutionizing the most diverse domains, like image recognition [1], playing board games [2], or society integration of refugees [3]. Also in chemistry, anincreasing range of applications is being tackled with ML, for example, the design and discovery of new molecules and materials [4, 5, 6]. In the present study, we show how ML enables efficient photodynamics simulations. Photodynamics is the study of photo-induced processes that occur after a molecule is exposed to light. Photosynthesis or DNA photodamage leading to skin cancer are only two examples of phenomena that involve molecules interacting with light [7, 8, 9, 10, 11]. The simulation of such processes has been key to learn structure-dynamicsfunction relationshipsthat can be used to guide the design of photonic materials, such as photosensitive drugs [12], photocatalysts [4] and photovoltaics [13, 14].