Predicting the Energy Landscape of Stochastic Dynamical System via Physics-informed Self-supervised Learning
Li, Ruikun, Wang, Huandong, Liao, Qingmin, Li, Yong
–arXiv.org Artificial Intelligence
Energy landscapes play a crucial role in shaping dynamics of many real-world complex systems. System evolution is often modeled as particles moving on a landscape under the combined effect of energy-driven drift and noise-induced diffusion, where the energy governs the long-term motion of the particles. Estimating the energy landscape of a system has been a longstanding interdisciplinary challenge, hindered by the high operational costs or the difficulty of obtaining supervisory signals. Therefore, the question of how to infer the energy landscape in the absence of true energy values is critical. In this paper, we propose a physics-informed self-supervised learning method to learn the energy landscape from the evolution trajectories of the system. Experimental results across interdisciplinary systems demonstrate that our estimated energy has a correlation coefficient above 0.9 with the ground truth, and evolution prediction accuracy exceeds the baseline by an average of 17.65%. Energy landscapes are inherent in many stochastic dynamical systems in nature, such as the potential energy surface of protein conformations (Norn et al., 2021), the fitness landscape of species evolution (Papkou et al., 2023; Poelwijk et al., 2007), and the fractal energy landscapes of soft glassy materials. The evolution of these systems can be modeled as particles moving on the landscape under the combined effect of energy-driven drift and noise-induced diffusion. When multiple low-energy regions exist in the landscape, the combined effect of the energy gradient and noise induces high-frequency movement within individual regions and low-frequency transitions between different regions (Lin et al., 2024). In this context, energy landscapes have been applied to guide the generation of stable molecular structures (No e et al., 2019) and direct the evolution of proteins (Packer & Liu, 2015; Greenbury et al., 2022), and more recently, they have been incorporated as physical knowledge into deep learning for predicting system evolution (Guan et al., 2024; Wang et al., 2024b; Ding et al., 2024). Couce et al. (2024) cultivate 50,000 generations of bacteria to measure the fitness effects of mutations, while Sarkisyan et al. (2016) measure tens of thousands of luminescent protein genotypic sequences to construct the functional landscape. These manual experimental approaches are not only costly but also heavily reliant on expert knowledge. With the success of deep learning in numerous disciplines (Jumper et al., 2021; Han et al., 2023; Wang et al., 2023; Chen et al., 2024), several deep learning models have been proposed to estimate energy or equivalent quantities based on molecular spatial structures (Zhang et al., 2018), species genotypes (Tonner et al., 2022), or population compositions (Skwara et al., 2023). These methods still require high-cost annotations to provide supervisory signals for energy, which limits their practicality.
arXiv.org Artificial Intelligence
Feb-23-2025