Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask
Senane, Zineb, Cao, Lele, Buchner, Valentin Leonhard, Tashiro, Yusuke, You, Lei, Herman, Pawel, Nordahl, Mats, Tu, Ruibo, von Ehrenheim, Vilhelm
–arXiv.org Artificial Intelligence
Time Series Representation Learning (TSRL) focuses on generating informative representations for various Time Series (TS) modeling tasks. Traditional Self-Supervised Learning (SSL) methods in TSRL fall into four main categories: reconstructive, adversarial, contrastive, and predictive, each with a common challenge of sensitivity to noise and intricate data nuances. Recently, diffusion-based methods have shown advanced generative capabilities. However, they primarily target specific application scenarios like imputation and forecasting, leaving a gap in leveraging diffusion models for generic TSRL. Our work, Time Series Diffusion Embedding (TSDE), bridges this gap as the first diffusion-based SSL TSRL approach. TSDE segments TS data into observed and masked parts using an Imputation-Interpolation-Forecasting (IIF) mask. It applies a trainable embedding function, featuring dual-orthogonal Transformer encoders with a crossover mechanism, to the observed part. We train a reverse diffusion process conditioned on the embeddings, designed to predict noise added to the masked part. Extensive experiments demonstrate TSDE's superiority in imputation, interpolation, forecasting, anomaly detection, classification, and clustering. We also conduct an ablation study, present embedding visualizations, and compare inference speed, further substantiating TSDE's efficiency and validity in learning representations of TS data.
arXiv.org Artificial Intelligence
Jun-17-2024
- Country:
- Asia
- Europe
- North America
- Canada (0.04)
- United States
- Alabama (0.04)
- Alaska > Anchorage Municipality
- Anchorage (0.04)
- California > San Francisco County
- San Francisco (0.04)
- New York > New York County
- New York City (0.04)
- Pacific Ocean > North Pacific Ocean
- San Francisco Bay (0.04)
- Genre:
- Research Report
- New Finding (0.46)
- Promising Solution (0.45)
- Research Report
- Industry:
- Health & Medicine (1.00)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Inductive Learning (0.70)
- Neural Networks > Deep Learning (1.00)
- Performance Analysis > Accuracy (0.67)
- Statistical Learning (1.00)
- Natural Language > Large Language Model (0.67)
- Machine Learning
- Data Science > Data Mining (1.00)
- Artificial Intelligence
- Information Technology