Lu, Hua
Not All Data are Good Labels: On the Self-supervised Labeling for Time Series Forecasting
Yang, Yuxuan, Zhang, Dalin, Liang, Yuxuan, Lu, Hua, Chen, Gang, Li, Huan
Time Series Forecasting (TSF) is a crucial task in various domains, yet existing TSF models rely heavily on high-quality data and insufficiently exploit all available data. This paper explores a novel self-supervised approach to re-label time series datasets by inherently constructing candidate datasets. During the optimization of a simple reconstruction network, intermediates are used as pseudo labels in a self-supervised paradigm, improving generalization for any predictor. We introduce the Self-Correction with Adaptive Mask (SCAM), which discards overfitted components and selectively replaces them with pseudo labels generated from reconstructions. Additionally, we incorporate Spectral Norm Regularization (SNR) to further suppress overfitting from a loss landscape perspective. Our experiments on eleven real-world datasets demonstrate that SCAM consistently improves the performance of various backbone models. This work offers a new perspective on constructing datasets and enhancing the generalization of TSF models through self-supervised learning.
Missing Value Imputation for Multi-attribute Sensor Data Streams via Message Propagation (Extended Version)
Li, Xiao, Li, Huan, Lu, Hua, Jensen, Christian S., Pandey, Varun, Markl, Volker
Sensor data streams occur widely in various real-time applications in the context of the Internet of Things (IoT). However, sensor data streams feature missing values due to factors such as sensor failures, communication errors, or depleted batteries. Missing values can compromise the quality of real-time analytics tasks and downstream applications. Existing imputation methods either make strong assumptions about streams or have low efficiency. In this study, we aim to accurately and efficiently impute missing values in data streams that satisfy only general characteristics in order to benefit real-time applications more widely. First, we propose a message propagation imputation network (MPIN) that is able to recover the missing values of data instances in a time window. We give a theoretical analysis of why MPIN is effective. Second, we present a continuous imputation framework that consists of data update and model update mechanisms to enable MPIN to perform continuous imputation both effectively and efficiently. Extensive experiments on multiple real datasets show that MPIN can outperform the existing data imputers by wide margins and that the continuous imputation framework is efficient and accurate.
LightCTS: A Lightweight Framework for Correlated Time Series Forecasting
Lai, Zhichen, Zhang, Dalin, Li, Huan, Jensen, Christian S., Lu, Hua, Zhao, Yan
Correlated time series (CTS) forecasting plays an essential role in many practical applications, such as traffic management and server load control. Many deep learning models have been proposed to improve the accuracy of CTS forecasting. However, while models have become increasingly complex and computationally intensive, they struggle to improve accuracy. Pursuing a different direction, this study aims instead to enable much more efficient, lightweight models that preserve accuracy while being able to be deployed on resource-constrained devices. To achieve this goal, we characterize popular CTS forecasting models and yield two observations that indicate directions for lightweight CTS forecasting. On this basis, we propose the LightCTS framework that adopts plain stacking of temporal and spatial operators instead of alternate stacking that is much more computationally expensive. Moreover, LightCTS features light temporal and spatial operator modules, called L-TCN and GL-Former, that offer improved computational efficiency without compromising their feature extraction capabilities. LightCTS also encompasses a last-shot compression scheme to reduce redundant temporal features and speed up subsequent computations. Experiments with single-step and multi-step forecasting benchmark datasets show that LightCTS is capable of nearly state-of-the-art accuracy at much reduced computational and storage overheads.
PLATO-K: Internal and External Knowledge Enhanced Dialogue Generation
Bao, Siqi, He, Huang, Xu, Jun, Lu, Hua, Wang, Fan, Wu, Hua, Zhou, Han, Wu, Wenquan, Niu, Zheng-Yu, Wang, Haifeng
Recently, the practical deployment of open-domain dialogue systems has been plagued by the knowledge issue of information deficiency and factual inaccuracy. To this end, we introduce PLATO-K based on two-stage dialogic learning to strengthen internal knowledge memorization and external knowledge exploitation. In the first stage, PLATO-K learns through massive dialogue corpora and memorizes essential knowledge into model parameters. In the second stage, PLATO-K mimics human beings to search for external information and to leverage the knowledge in response generation. Extensive experiments reveal that the knowledge issue is alleviated significantly in PLATO-K with such comprehensive internal and external knowledge enhancement. Compared to the existing state-of-the-art Chinese dialogue model, the overall engagingness of PLATO-K is improved remarkably by 36.2% and 49.2% on chit-chat and knowledge-intensive conversations.
Comparing Alternative Route Planning Techniques: A Web-based Demonstration and User Study
Li, Lingxiao, Cheema, Muhammad Aamir, Lu, Hua, Ali, Mohammed Eunus, Toosi, Adel N.
Due to the popularity of smartphones, cheap wireless networks and availability of road network data, navigation applications have become a part of our everyday life. Many modern navigation systems and map-based services do not only provide the fastest route from a source location s to a target location t but also provide a few alternative routes to the users as more options to choose from. Consequently, computing alternative paths from a source s to a target t has received significant research attention in the past few years. However, it is not clear which of the existing approaches generates alternative paths of better quality because the quality of these alternatives is mostly subjective. Motivated by this, in this paper, we present the first user study that compares the quality of the alternative routes generated by four of the most popular existing approaches including the routes provided by Google Maps. We also present the details of a web-based demo system that can be accessed using any internet enabled device and allows users to see the alternative routes generated by the four approaches for any pair of source and target selected by the users. Our user study shows that although the mean rating received by Google Maps is slightly lower than the mean ratings received by the other three approaches, the results are not statistically significant. We also discuss the limitations of this user study and recommend the readers to interpret these results with caution because certain factors beyond our control may have affected the participants' ratings.