Multi-Modal Forecaster: Jointly Predicting Time Series and Textual Data

Kim, Kai, Tsai, Howard, Sen, Rajat, Das, Abhimanyu, Zhou, Zihao, Tanpure, Abhishek, Luo, Mathew, Yu, Rose

Nov-20-2024–arXiv.org Artificial Intelligence

Current forecasting approaches are largely unimodal and ignore the rich textual data that often accompany the time series due to lack of well-curated multimodal benchmark dataset. In this work, we develop TimeText Corpus (TTC), a carefully curated, time-aligned text and time dataset for multimodal forecasting. Our dataset is composed of sequences of numbers and text aligned to timestamps, and includes data from two different domains: climate science and healthcare. Our data is a significant contribution to the rare selection of available multimodal datasets. We also propose the Hybrid Multi-Modal Forecaster (Hybrid-MMF), a multimodal LLM that jointly forecasts both text and time series data using shared embeddings. However, contrary to our expectations, our Hybrid-MMF model does not outperform existing baselines in our experiments. This negative result highlights the challenges inherent in multimodal forecasting. Deep learning has become the predominant method in forecasting large-scale time series Zhou et al. (2022); Wang et al. (2022); Woo et al. (2023), but most existing methods consider time series as a single data modality. In practice, time series data do not exist in isolation and there are rich text meta-data available.

forecasting, text2text, texttime2texttime, (13 more...)

arXiv.org Artificial Intelligence

Nov-20-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - Canada (0.14)
  - United States
    - District of Columbia > Washington (0.04)
    - Ohio (0.04)
    - Nevada (0.04)
    - Alaska (0.04)
    - Michigan > Washtenaw County
      - Ann Arbor (0.04)
    - California > San Diego County
      - San Diego (0.04)
  - Trinidad and Tobago > Trinidad
    - Arima > Arima (0.04)
- Europe > Spain
  - Catalonia > Barcelona Province > Barcelona (0.04)

Genre:
- Research Report > New Finding (0.66)

Industry:
- Health & Medicine (1.00)
- Government > Regional Government
  - North America Government > United States Government (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)