Goto

Collaborating Authors

 initial molecule


41da609c519d77b29be442f8c1105647-Supplemental.pdf

Neural Information Processing Systems

A.1 Additional experimental results We further introduce our additional experiments in this section. In our main article, we compared our model FREED with baseline models REINVENT and MORLD. For fairer comparison of quality scores, we also performed multi-objective optimization of REINVENT and MORLD on both quality score (pharmacochemical filter score) and docking score as follows. Table 1 in the main text shows that such an implicit method is not enough to achieve nearly perfect filter scores as our model did. Also, as shown in Table 1 REINVENT showed deteriorated performance when jointly trained with filter scores, in terms of hit ratio and top 5% scores, implying that multiobjective optimization is more difficult than explicitly constrained optimization. Such a result was consistent for all three targets. The two baseline models REINVENT and MORLD that are jointly trained to maximize filter scores are noted as REINVENT w/ filter and MORLD w/ filter.


Distributed Reinforcement Learning for Molecular Design: Antioxidant case

arXiv.org Artificial Intelligence

Deep reinforcement learning has successfully been applied for molecular discovery as shown by the Molecule Deep Q-network (MolDQN) algorithm. This algorithm has challenges when applied to optimizing new molecules: training such a model is limited in terms of scalability to larger datasets and the trained model cannot be generalized to different molecules in the same dataset. In this paper, a distributed reinforcement learning algorithm for antioxidants, called DA-MolDQN is proposed to address these problems. State-of-the-art bond dissociation energy (BDE) and ionization potential (IP) predictors are integrated into DA-MolDQN, which are critical chemical properties while optimizing antioxidants. Training time is reduced by algorithmic improvements for molecular modifications. The algorithm is distributed, scalable for up to 512 molecules, and generalizes the model to a diverse set of molecules. The proposed models are trained with a proprietary antioxidant dataset. The results have been reproduced with both proprietary and public datasets. The proposed molecules have been validated with DFT simulations and a subset of them confirmed in public "unseen" datasets. In summary, DA-MolDQN is up to 100x faster than previous algorithms and can discover new optimized molecules from proprietary and public antioxidants.