Cheng, Lin
Adviser-Actor-Critic: Eliminating Steady-State Error in Reinforcement Learning Control
Chen, Donghe, Peng, Yubin, Zheng, Tengjie, Wang, Han, Qu, Chaoran, Cheng, Lin
High-precision control tasks present substantial Dynamic modeling is crucial for understanding robot behavior challenges for reinforcement learning (RL) algorithms, and designing control strategies. However, real-world frequently resulting in suboptimal performance systems often display nonlinear behavior, making it difficult attributed to network approximation inaccuracies to create accurate models. Additionally, the highdimensional and inadequate sample quality.These state space of robots can lead to complex interactions issues are exacerbated when the task requires the between components, further complicating control agent to achieve a precise goal state, as is common (Buşoniu et al., 2018; Zhao et al., 2020a;b; Cao et al., 2023). in robotics and other real-world applications.We To highlight these challenges, we discuss the attributes and introduce Adviser-Actor-Critic (AAC), designed limitations of existing control algorithms.
Error Distribution Smoothing:Advancing Low-Dimensional Imbalanced Regression
Chen, Donghe, Yue, Jiaxuan, Zheng, Tengjie, Wang, Lanxuan, Cheng, Lin
In real-world regression tasks, datasets frequently exhibit imbalanced distributions, characterized by a scarcity of data in high-complexity regions and an abundance in low-complexity areas. This imbalance presents significant challenges for existing classification methods with clear class boundaries, while highlighting a scarcity of approaches specifically designed for imbalanced regression problems. To better address these issues, we introduce a novel concept of Imbalanced Regression, which takes into account both the complexity of the problem and the density of data points, extending beyond traditional definitions that focus only on data density. Furthermore, we propose Error Distribution Smoothing (EDS) as a solution to tackle imbalanced regression, effectively selecting a representative subset from the dataset to reduce redundancy while maintaining balance and representativeness. Through several experiments, EDS has shown its effectiveness, and the related code and dataset can be accessed at https://anonymous.4open.science/r/Error-Distribution-Smoothing-762F.
Recursive Gaussian Process State Space Model
Zheng, Tengjie, Cheng, Lin, Gong, Shengping, Huang, Xu
Learning dynamical models from data is not only fundamental but also holds great promise for advancing principle discovery, time-series prediction, and controller design. Among various approaches, Gaussian Process State-Space Models (GPSSMs) have recently gained significant attention due to their combination of flexibility and interpretability. However, for online learning, the field lacks an efficient method suitable for scenarios where prior information regarding data distribution and model function is limited. To address this issue, this paper proposes a recursive GPSSM method with adaptive capabilities for both operating domains and Gaussian process (GP) hyperparameters. Specifically, we first utilize first-order linearization to derive a Bayesian update equation for the joint distribution between the system state and the GP model, enabling closed-form and domain-independent learning. Second, an online selection algorithm for inducing points is developed based on informative criteria to achieve lightweight learning. Third, to support online hyperparameter optimization, we recover historical measurement information from the current filtering distribution. Comprehensive evaluations on both synthetic and real-world datasets demonstrate the superior accuracy, computational efficiency, and adaptability of our method compared to state-of-the-art online GPSSM techniques.
Graph Attention-Based Symmetry Constraint Extraction for Analog Circuits
Xu, Qi, Wang, Lijie, Wang, Jing, Chen, Song, Cheng, Lin, Kang, Yi
In recent years, analog circuits have received extensive attention and are widely used in many emerging applications. The high demand for analog circuits necessitates shorter circuit design cycles. To achieve the desired performance and specifications, various geometrical symmetry constraints must be carefully considered during the analog layout process. However, the manual labeling of these constraints by experienced analog engineers is a laborious and time-consuming process. To handle the costly runtime issue, we propose a graph-based learning framework to automatically extract symmetric constraints in analog circuit layout. The proposed framework leverages the connection characteristics of circuits and the devices'information to learn the general rules of symmetric constraints, which effectively facilitates the extraction of device-level constraints on circuit netlists. The experimental results demonstrate that compared to state-of-the-art symmetric constraint detection approaches, our framework achieves higher accuracy and lower false positive rate.
Intelligent diagnostic scheme for lung cancer screening with Raman spectra data by tensor network machine learning
An, Yu-Jia, Bai, Sheng-Chen, Cheng, Lin, Li, Xiao-Guang, Wang, Cheng-en, Han, Xiao-Dong, Su, Gang, Ran, Shi-Ju, Wang, Cong
Artificial intelligence (AI) has brought tremendous impacts on biomedical sciences from academic researches to clinical applications, such as in biomarkers' detection and diagnosis, optimization of treatment, and identification of new therapeutic targets in drug discovery. However, the contemporary AI technologies, particularly deep machine learning (ML), severely suffer from non-interpretability, which might uncontrollably lead to incorrect predictions. Interpretability is particularly crucial to ML for clinical diagnosis as the consumers must gain necessary sense of security and trust from firm grounds or convincing interpretations. In this work, we propose a tensor-network (TN)-ML method to reliably predict lung cancer patients and their stages via screening Raman spectra data of Volatile organic compounds (VOCs) in exhaled breath, which are generally suitable as biomarkers and are considered to be an ideal way for non-invasive lung cancer screening. The prediction of TN-ML is based on the mutual distances of the breath samples mapped to the quantum Hilbert space. Thanks to the quantum probabilistic interpretation, the certainty of the predictions can be quantitatively characterized. The accuracy of the samples with high certainty is almost 100$\%$. The incorrectly-classified samples exhibit obviously lower certainty, and thus can be decipherably identified as anomalies, which will be handled by human experts to guarantee high reliability. Our work sheds light on shifting the ``AI for biomedical sciences'' from the conventional non-interpretable ML schemes to the interpretable human-ML interactive approaches, for the purpose of high accuracy and reliability.
Reinforcement Learning for Standards Design
Kasi, Shahrukh Khan, Mukherjee, Sayandev, Cheng, Lin, Huberman, Bernardo A.
Communications standards are designed via committees of humans holding repeated meetings over months or even years until consensus is achieved. This includes decisions regarding the modulation and coding schemes to be supported over an air interface. We propose a way to "automate" the selection of the set of modulation and coding schemes to be supported over a given air interface and thereby streamline both the standards design process and the ease of extending the standard to support new modulation schemes applicable to new higher-level applications and services. Our scheme involves machine learning, whereby a constructor entity submits proposals to an evaluator entity, which returns a score for the proposal. The constructor employs reinforcement learning to iterate on its submitted proposals until a score is achieved that was previously agreed upon by both constructor and evaluator to be indicative of satisfying the required design criteria (including performance metrics for transmissions over the interface).