Synthetic Series-Symbol Data Generation for Time Series Foundation Models
Wang, Wenxuan, Wu, Kai, Li, Yujian Betterest, Wang, Dan, Zhang, Xiaoyu
–arXiv.org Artificial Intelligence
Foundation models for time series analysis (TSA) have attracted significant attention. However, challenges such as training data scarcity and imbalance continue to hinder their development. Inspired by complex dynamic system theories, we design a series-symbol data generation mechanism, enabling the unrestricted creation of high-quality time series data paired with corresponding symbolic expressions. To leverage series-symbol data pairs with strong correlations, we develop SymTime, a pre-trained foundation model for enhancing time series representation using symbolic information. SymTime demonstrates competitive performance across five major TSA tasks when fine-tunes with downstream tasks, rivaling foundation models pre-trained on real-world datasets. This approach underscores the potential of series-symbol data generation and pretraining mechanisms in overcoming data scarcity and enhancing task performance. The code is available at https://github.com/wwhenxuan/SymTime.
arXiv.org Artificial Intelligence
Oct-21-2025
- Country:
- North America > United States (0.67)
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Industry:
- Information Technology > Security & Privacy (0.67)
- Banking & Finance (0.67)
- Technology:
- Information Technology
- Modeling & Simulation (1.00)
- Data Science > Data Mining (1.00)
- Communications (0.92)
- Artificial Intelligence
- Representation & Reasoning (1.00)
- Natural Language > Large Language Model (1.00)
- Cognitive Science (0.92)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Information Technology