UPS: Efficiently Building Foundation Models for PDE Solving via Cross-Modal Adaptation
Shen, Junhong, Marwah, Tanya, Talwalkar, Ameet
–arXiv.org Artificial Intelligence
UPS embeds different PDEs into a shared representation space and processes them using a FNO-transformer architecture. Rather than training the network from scratch, which is data-demanding and computationally expensive, we warm-start the transformer from pretrained LLMs and perform explicit alignment to reduce the modality gap while improving data and compute efficiency. The cross-modal UPS achieves state-of-the-art results on a wide range of 1D and 2D PDE families from PDEBench, outperforming existing unified models using 4 times less data and 26 times less compute. Meanwhile, it is capable of few-shot transfer to unseen PDE families and coefficients.
arXiv.org Artificial Intelligence
May-23-2024
- Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- Genre:
- Research Report (1.00)
- Technology: