Wang, Dongdong
Robust Causal Learning for the Estimation of Average Treatment Effects
Huang, Yiyan, Leung, Cheuk Hang, Yan, Xing, Wu, Qi, Ma, Shumin, Yuan, Zhiri, Wang, Dongdong, Huang, Zhixiang
--Many practical decision-making problems in economics and healthcare seek to estimate the average treatment effect (A TE) from observational data. The Double/Debiased Machine Learning (DML) is one of the prevalent methods to estimate A TE in the observational study. However, the DML estimators can suffer an error-compounding issue and even give an extreme estimate when the propensity scores are misspecified or very close to 0 or 1. Previous studies have overcome this issue through some empirical tricks such as propensity score trimming, yet none of the existing literature solves this problem from a theoretical standpoint. In this paper, we propose a Robust Causal Learning (RCL) method to offset the deficiencies of the DML estimators. Theoretically, the RCL estimators i) are as consistent and doubly robust as the DML estimators, and ii) can get rid of the error-compounding issue. Empirically, the comprehensive experiments show that i) the RCL estimators give more stable estimations of the causal parameters than the DML estimators, and ii) the RCL estimators outperform the traditional estimators and their variants when applying different machine learning models on both simulation and benchmark datasets. Causal inference is ubiquitous for decision-making problems in various areas such as Healthcare [1]-[3] and Economics [4]-[6]. At the core of causal machine learning, estimating the average treatment effect (A TE) from observational data is challenging because some features (covariates) can influence both treatment and outcome in most practical circumstances. Co-first authors are in alphabetical order. Qi Wu is the corresponding author. Shumin Ma is also with Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, BNU-HKBU United International College. To obtain a clean A TE, one can conduct the Randomized Controlled Trials (RCTs). RCTs are regarded as the gold standard to evaluate A TE, whereas conducting RCTs is often expensive and time-consuming.
Deep Epidemiological Modeling by Black-box Knowledge Distillation: An Accurate Deep Learning Model for COVID-19
Wang, Dongdong, Zhang, Shunpu, Wang, Liqiang
An accurate and efficient forecasting system is imperative to the prevention of emerging infectious diseases such as COVID-19 in public health. This system requires accurate transient modeling, lower computation cost, and fewer observation data. To tackle these three challenges, we propose a novel deep learning approach using black-box knowledge distillation for both accurate and efficient transmission dynamics prediction in a practical manner. First, we leverage mixture models to develop an accurate, comprehensive, yet impractical simulation system. Next, we use simulated observation sequences to query the simulation system to retrieve simulated projection sequences as knowledge. Then, with the obtained query data, sequence mixup is proposed to improve query efficiency, increase knowledge diversity, and boost distillation model accuracy. Finally, we train a student deep neural network with the retrieved and mixed observation-projection sequences for practical use. The case study on COVID-19 justifies that our approach accurately projects infections with much lower computation cost when observation data are limited.
The Causal Learning of Retail Delinquency
Huang, Yiyan, Leung, Cheuk Hang, Yan, Xing, Wu, Qi, Peng, Nanbo, Wang, Dongdong, Huang, Zhixiang
This paper focuses on the expected difference in borrower's repayment when there is a change in the lender's credit decisions. Classical estimators overlook the confounding effects and hence the estimation error can be magnificent. As such, we propose another approach to construct the estimators such that the error can be greatly reduced. The proposed estimators are shown to be unbiased, consistent, and robust through a combination of theoretical analysis and numerical testing. Moreover, we compare the power of estimating the causal quantities between the classical estimators and the proposed estimators. The comparison is tested across a wide range of models, including linear regression models, tree-based models, and neural network-based models, under different simulated datasets that exhibit different levels of causality, different degrees of nonlinearity, and different distributional properties. Most importantly, we apply our approaches to a large observational dataset provided by a global technology firm that operates in both the e-commerce and the lending business. We find that the relative reduction of estimation error is strikingly substantial if the causal effects are accounted for correctly.