Mixture-of-Experts Operator Transformer for Large-Scale PDE Pre-Training
–Neural Information Processing Systems
Pre-training has proven effective in addressing data scarcity and performance limitations in solving PDE problems with neural operators. However, challenges remain due to the heterogeneity of PDE datasets in equation types, which leads to high errors in mixed training. Additionally, dense pre-training models that scale parameters by increasing network width or depth incur significant inference costs.
Neural Information Processing Systems
Jun-11-2026, 10:04:43 GMT
- Technology: