Learning to Factorize Spatio-Temporal Foundation Models
–Neural Information Processing Systems
Spatio-Temporal (ST) Foundation Models (STFMs) promise cross-dataset generalization, yet joint ST pretraining is computationally costly and struggles with domain-specific spatial correlations. To address this, we propose FactoST, a factorized STFM that decouples universal temporal pretraining from ST adaptation. The first stage trains a space-agnostic backbone via multi-task learning to capture multifrequency, cross-domain temporal patterns at low cost. The second stage attaches an lightweight adapter that rapidly adapts the backbone to specific ST domains via metadata fusion, interaction pruning, domain alignment, and memory replay. Extensive forecasting experiments show that in few-shot settings, FactoST reduces MAE by up to 46.4% versus UniST, uses 46.2% fewer parameters, achieves 68% faster inference than OpenCity, and remains competitive with expert models. This factorized view offers a practical, scalable path toward truly universal STFMs.
Neural Information Processing Systems
Jun-19-2026, 00:26:03 GMT
- Genre:
- Overview (1.00)
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Industry:
- Transportation (0.68)
- Energy
- Power Industry (0.67)
- Renewable > Solar (0.46)
- Technology: