Vision-Enhanced Time Series Forecasting via Latent Diffusion Models

Ruan, Weilin, Zhong, Siru, Wen, Haomin, Liang, Yuxuan

Feb-16-2025–arXiv.org Artificial Intelligence

Diffusion models have recently emerged as powerful frameworks for generating high-quality images. While recent studies have explored their application to time series forecasting, these approaches face significant challenges in cross-modal modeling and transforming visual information effectively to capture temporal patterns. In this paper, we propose LDM4TS, a novel framework that leverages the powerful image reconstruction capabilities of latent diffusion models for vision-enhanced time series forecasting. Instead of introducing external visual data, we are the first to use complementary transformation techniques to convert time series into multi-view visual representations, allowing the model to exploit the rich feature extraction capabilities of the pre-trained vision encoder. Subsequently, these representations are reconstructed using a latent diffusion model with a cross-modal conditioning mechanism as well as a fusion module. Experimental results demonstrate that LDM4TS outperforms various specialized forecasting models for time series forecasting tasks.

diffusion model, forecasting, representation, (12 more...)

arXiv.org Artificial Intelligence

Feb-16-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.14)
- Europe > Italy
  - Calabria > Catanzaro Province > Catanzaro (0.04)
- Asia > China
  - Hong Kong (0.04)
  - Guangdong Province > Guangzhou (0.04)

Genre:
- Research Report > New Finding (0.48)

Industry:
- Energy (0.46)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Artificial Intelligence
    - Natural Language > Large Language Model (0.68)
    - Machine Learning
      - Neural Networks > Deep Learning (1.00)
      - Statistical Learning (0.68)
      - Pattern Recognition (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found