Su, Gang
Transformer-based Drum-level Prediction in a Boiler Plant with Delayed Relations among Multivariates
Su, Gang, Yang, Sun, Li, Zhishuai
Abstract--The steam drum water level is a critical parameter that directly impacts the safety and efficiency of power plant operations. There is usually physical information behind the complex interrelations: for I. Proper control of drum level is essential for B's outlet are shared via a same parent pipe; thus, their ensuring continuous steam production while adhering to safety outlet pressures are always in the same trend. However, achieving precise control of drum level to understand all the variables causal interrelation is critical to is a challenging task due to various process disturbances and predict the final drum level. Traditionally, control Moreover, given the long reaction chain, for example, strategies for drum level regulation have relied on feedback increasing pump A's inlet flow will take a long delay to be and feedforward control techniques, often employing Proportional reflected in the increasing drum level (usually the delay will Integral Derivative (PID [1]) controllers in conjunction be around 100 seconds and it varies to different variables and with rule-based feedforward controllers. While these strategies different boiler plant), so when predicting, pump A's inlet flow have been effective to some extent, they often struggle to at time step t should be used to predict the drum level at time adapt to changing operating conditions and fail to capture the step t + t In recent years, there has been a growing interest in In this study, we aim to develop a data-driven model leveraging advanced deep-learning techniques to predict the based on Transformer architecture for predicting drum level future drum level, which enhances the PID via feedforward variations in steam boilers.
Tensor networks for interpretable and efficient quantum-inspired machine learning
Ran, Shi-Ju, Su, Gang
It is a critical challenge to simultaneously gain high interpretability and efficiency with the current schemes of deep machine learning (ML). Tensor network (TN), which is a well-established mathematical tool originating from quantum mechanics, has shown its unique advantages on developing efficient ``white-box'' ML schemes. Here, we give a brief review on the inspiring progresses made in TN-based ML. On one hand, interpretability of TN ML is accommodated with the solid theoretical foundation based on quantum information and many-body physics. On the other hand, high efficiency can be rendered from the powerful TN representations and the advanced computational techniques developed in quantum many-body physics. With the fast development on quantum computers, TN is expected to conceive novel schemes runnable on quantum hardware, heading towards the ``quantum artificial intelligence'' in the forthcoming future.
Intelligent diagnostic scheme for lung cancer screening with Raman spectra data by tensor network machine learning
An, Yu-Jia, Bai, Sheng-Chen, Cheng, Lin, Li, Xiao-Guang, Wang, Cheng-en, Han, Xiao-Dong, Su, Gang, Ran, Shi-Ju, Wang, Cong
Artificial intelligence (AI) has brought tremendous impacts on biomedical sciences from academic researches to clinical applications, such as in biomarkers' detection and diagnosis, optimization of treatment, and identification of new therapeutic targets in drug discovery. However, the contemporary AI technologies, particularly deep machine learning (ML), severely suffer from non-interpretability, which might uncontrollably lead to incorrect predictions. Interpretability is particularly crucial to ML for clinical diagnosis as the consumers must gain necessary sense of security and trust from firm grounds or convincing interpretations. In this work, we propose a tensor-network (TN)-ML method to reliably predict lung cancer patients and their stages via screening Raman spectra data of Volatile organic compounds (VOCs) in exhaled breath, which are generally suitable as biomarkers and are considered to be an ideal way for non-invasive lung cancer screening. The prediction of TN-ML is based on the mutual distances of the breath samples mapped to the quantum Hilbert space. Thanks to the quantum probabilistic interpretation, the certainty of the predictions can be quantitatively characterized. The accuracy of the samples with high certainty is almost 100$\%$. The incorrectly-classified samples exhibit obviously lower certainty, and thus can be decipherably identified as anomalies, which will be handled by human experts to guarantee high reliability. Our work sheds light on shifting the ``AI for biomedical sciences'' from the conventional non-interpretable ML schemes to the interpretable human-ML interactive approaches, for the purpose of high accuracy and reliability.
Quantum Compressed Sensing with Unsupervised Tensor Network Machine Learning
Ran, Shi-Ju, Sun, Zheng-Zhi, Fei, Shao-Ming, Su, Gang, Lewenstein, Maciej
We propose tensor-network compressed sensing (TNCS) for compressing and communicating classical information via the quantum states trained by the unsupervised tensor network (TN) machine learning. The main task of TNCS is to reconstruct as accurately as possible the full classical information from a generative TN state, by knowing as small part of the classical information as possible. In the applications to the datasets of hand-written digits and fashion images, we train the generative TN (matrix product state) by the training set, and show that the images in the testing set can be reconstructed from a small number of pixels. Related issues including the applications of TNCS to quantum encrypted communication are discussed.
Generative Tensor Network Classification Model for Supervised Machine Learning
Sun, Zheng-Zhi, Peng, Cheng, Liu, Ding, Ran, Shi-Ju, Su, Gang
Tensor network (TN) has recently triggered extensive interests in developing machine-learning models in quantum many-body Hilbert space. Here we purpose a generative TN classification (GTNC) approach for supervised learning. The strategy is to train the generative TN for each class of the samples to construct the classifiers. The classification is implemented by comparing the distance in the many-body Hilbert space. The numerical experiments by GTNC show impressive performance on the MNIST and Fashion-MNIST dataset. The testing accuracy is competitive to the state-of-the-art convolutional neural network while higher than the naive Bayes classifier (a generative classifier) and support vector machine. Moreover, GTNC is more efficient than the existing TN models that are in general discriminative. By investigating the distances in the many-body Hilbert space, we find that (a) the samples are naturally clustering in such a space; and (b) bounding the bond dimensions of the TN's to finite values corresponds to removing redundant information in the image recognition. These two characters make GTNC an adaptive and universal model of excellent performance.
Machine Learning by Two-Dimensional Hierarchical Tensor Networks: A Quantum Information Theoretic Perspective on Deep Architectures
Liu, Ding, Ran, Shi-Ju, Wittek, Peter, Peng, Cheng, García, Raul Blázquez, Su, Gang, Lewenstein, Maciej
The resemblance between the methods used in studying quantum-many body physics and in machine learning has drawn considerable attention. In particular, tensor networks (TNs) and deep learning architectures bear striking similarities to the extent that TNs can be used for machine learning. Previous results used one-dimensional TNs in image recognition, showing limited scalability and a high bond dimension. In this work, we train two-dimensional hierarchical TNs to solve image recognition problems, using a training algorithm derived from the multipartite entanglement renormalization ansatz (MERA). This approach overcomes scalability issues and implies novel mathematical connections among quantum many-body physics, quantum information theory, and machine learning. While keeping the TN unitary in the training phase, TN states can be defined, which optimally encodes each class of the images into a quantum many-body state. We study the quantum features of the TN states, including quantum entanglement and fidelity. We suggest these quantities could be novel properties that characterize the image classes, as well as the machine learning tasks. Our work could be further applied to identifying possible quantum properties of certain artificial intelligence methods.