Wu, Zhenxing
Token-Mol 1.0: Tokenized drug design with large language model
Wang, Jike, Qin, Rui, Wang, Mingyang, Fang, Meijing, Zhang, Yangyang, Zhu, Yuchen, Su, Qun, Gou, Qiaolin, Shen, Chao, Zhang, Odin, Wu, Zhenxing, Jiang, Dejun, Zhang, Xujun, Zhao, Huifeng, Wan, Xiaozhe, Wu, Zhourui, Liu, Liwei, Kang, Yu, Hsieh, Chang-Yu, Hou, Tingjun
Significant interests have recently risen in leveraging sequence-based large language models (LLMs) for drug design. However, most current applications of LLMs in drug discovery lack the ability to comprehend three-dimensional (3D) structures, thereby limiting their effectiveness in tasks that explicitly involve molecular conformations. In this study, we introduced Token-Mol, a token-only 3D drug design model. This model encodes all molecular information, including 2D and 3D structures, as well as molecular property data, into tokens, which transforms classification and regression tasks in drug discovery into probabilistic prediction problems, thereby enabling learning through a unified paradigm. Token-Mol is built on the transformer decoder architecture and trained using random causal masking techniques. Additionally, we proposed the Gaussian cross-entropy (GCE) loss function to overcome the challenges in regression tasks, significantly enhancing the capacity of LLMs to learn continuous numerical values. Through a combination of fine-tuning and reinforcement learning (RL), Token-Mol achieves performance comparable to or surpassing existing task-specific methods across various downstream tasks, including pocket-based molecular generation, conformation generation, and molecular property prediction. Compared to existing molecular pre-trained models, Token-Mol exhibits superior proficiency in handling a wider range of downstream tasks essential for drug design. Notably, our approach improves regression task accuracy by approximately 30% compared to similar token-only methods. Token-Mol overcomes the precision limitations of token-only models and has the potential to integrate seamlessly with general models such as ChatGPT, paving the way for the development of a universal artificial intelligence drug design model that facilitates rapid and high-quality drug design by experts.
Deep Geometry Handling and Fragment-wise Molecular 3D Graph Generation
Zhang, Odin, Huang, Yufei, Cheng, Shichen, Yu, Mengyao, Zhang, Xujun, Lin, Haitao, Zeng, Yundian, Wang, Mingyang, Wu, Zhenxing, Zhao, Huifeng, Zhang, Zaixi, Hua, Chenqing, Kang, Yu, Cui, Sunliang, Pan, Peichen, Hsieh, Chang-Yu, Hou, Tingjun
Most earlier 3D structure-based molecular generation approaches follow an atom-wise paradigm, incrementally adding atoms to a partially built molecular fragment within protein pockets. These methods, while effective in designing tightly bound ligands, often overlook other essential properties such as synthesizability. The fragment-wise generation paradigm offers a promising solution. However, a common challenge across both atom-wise and fragment-wise methods lies in their limited ability to co-design plausible chemical and geometrical structures, resulting in distorted conformations. In response to this challenge, we introduce the Deep Geometry Handling protocol, a more abstract design that extends the design focus beyond the model architecture. Through a comprehensive review of existing geometry-related models and their protocols, we propose a novel hybrid strategy, culminating in the development of FragGen - a geometry-reliable, fragment-wise molecular generation method. FragGen marks a significant leap forward in the quality of generated geometry and the synthesis accessibility of molecules. The efficacy of FragGen is further validated by its successful application in designing type II kinase inhibitors at the nanomolar level.