Tang, Yu
Physics-informed Machine Learning for Calibrating Macroscopic Traffic Flow Models
Tang, Yu, Jin, Li, Ozbay, Kaan
Well-calibrated traffic flow models are fundamental to understanding traffic phenomena and designing control strategies. Traditional calibration has been developed base on optimization methods. In this paper, we propose a novel physics-informed, learning-based calibration approach that achieves performances comparable to and even better than those of optimization-based methods. To this end, we combine the classical deep autoencoder, an unsupervised machine learning model consisting of one encoder and one decoder, with traffic flow models. Our approach informs the decoder of the physical traffic flow models and thus induces the encoder to yield reasonable traffic parameters given flow and speed measurements. We also introduce the denoising autoencoder into our method so that it can handles not only with normal data but also with corrupted data with missing values. We verified our approach with a case study of I-210 E in California.
Geometric sliding mode control of mechanical systems on Lie groups
Espindola, Eduardo, Tang, Yu
This paper presents a generalization of conventional sliding mode control designs for systems in Euclidean spaces to fully actuated simple mechanical systems whose configuration space is a Lie group for the trajectory-tracking problem. A generic kinematic control is first devised in the underlying Lie algebra, which enables the construction of a Lie group on the tangent bundle where the system state evolves. A sliding subgroup is then proposed on the tangent bundle with the desired sliding properties, and a control law is designed for the error dynamics trajectories to reach the sliding subgroup globally exponentially. Tracking control is then composed of the reaching law and sliding mode, and is applied for attitude tracking on the special orthogonal group SO(3) and the unit sphere S3. Numerical simulations show the performance of the proposed geometric sliding-mode controller (GSMC) in contrast with two control schemes of the literature.
A Deep Reinforcement Learning Approach for Ramp Metering Based on Traffic Video Data
Liu, Bing, Tang, Yu, Ji, Yuxiong, Shen, Yu, Du, Yuchuan
Ramp metering that uses traffic signals to regulate vehicle flows from the on-ramps has been widely implemented to improve vehicle mobility of the freeway. Previous studies generally update signal timings in real-time based on predefined traffic measures collected by point detectors, such as traffic volumes and occupancies. Comparing with point detectors, traffic cameras-which have been increasingly deployed on road networks-could cover larger areas and provide more detailed traffic information. In this work, we propose a deep reinforcement learning (DRL) method to explore the potential of traffic video data in improving the efficiency of ramp metering. The proposed method uses traffic video frames as inputs and learns the optimal control strategies directly from the high-dimensional visual inputs. A real-world case study demonstrates that, in comparison with a state-of-the-practice method, the proposed DRL method results in 1) lower travel times in the mainline, 2) shorter vehicle queues at the on-ramp, and 3) higher traffic flows downstream of the merging area. The results suggest that the proposed method is able to extract useful information from the video data for better ramp metering controls.
ADMMiRNN: Training RNN with Stable Convergence via An Efficient ADMM Approach
Tang, Yu, Kan, Zhigang, Sun, Dequan, Qiao, Linbo, Xiao, Jingjing, Lai, Zhiquan, Li, Dongsheng
It is hard to train Recurrent Neural Network (RNN) with stable convergence and avoid gradient vanishing and exploding, as the weights in the recurrent unit are repeated from iteration to iteration. Moreover, RNN is sensitive to the initialization of weights and bias, which brings difficulty in the training phase. With the gradient-free feature and immunity to poor conditions, the Alternating Direction Method of Multipliers (ADMM) has become a promising algorithm to train neural networks beyond traditional stochastic gradient algorithms. However, ADMM could not be applied to train RNN directly since the state in the recurrent unit is repetitively updated over timesteps. Therefore, this work builds a new framework named ADMMiRNN upon the unfolded form of RNN to address the above challenges simultaneously and provides novel update rules and theoretical convergence analysis. We explicitly specify key update rules in the iterations of ADMMiRNN with deliberately constructed approximation techniques and solutions to each subproblem instead of vanilla ADMM. Numerical experiments are conducted on MNIST and text classification tasks, where ADMMiRNN achieves convergent results and outperforms compared baselines. Furthermore, ADMMiRNN trains RNN in a more stable way without gradient vanishing or exploding compared to the stochastic gradient algorithms. Source code has been available at https://github.com/TonyTangYu/ADMMiRNN.
Median regression with differential privacy
Chen, E, Miao, Ying, Tang, Yu
Personal privacy information may be exposed with the unprecedented availability of datasets, so there is increasing requirement that statistical analysis of such datasets should protect individual privacy. As [6] describes, differential privacy addresses the paradox of learning nothing about an individual while learning useful information about a population. Over the past few years, differential privacy has been investigated in machine learning [1] and has been applied in the real world, see for example [8]. Recently, [3] formulates a general lower bound argument for minimax risks with differential privacy constraints, and applies this argument to high-dimensional mean estimation and linear regression problems. In this paper, three privacy preserving methods are proposed for median regression, which is a special case of quantile regression. Quantile regression was first introduced in [12], which aims to estimate and conduct inference about conditional quantile functions. In recent years, quantile regression has become a comprehensive method for statistical analysis of response models and it has been widely used in reality, such as survival analysis and economics, see for example, [14], [20] and [15]. The fact that the median regression takes least absolute deviation as its objective function to estimate parameters has been known among statisticians [12].