Mixture-of-Experts Operator Transformer for Large-Scale PDE Pre-Training

Jun-11-2026, 10:04:43 GMT–Neural Information Processing Systems

Pre-training has proven effective in addressing data scarcity and performance limitations in solving PDE problems with neural operators. However, challenges remain due to the heterogeneity of PDE datasets in equation types, which leads to high errors in mixed training. Additionally, dense pre-training models that scale parameters by increasing network width or depth incur significant inference costs.

artificial intelligence, machine learning, proceedings, (6 more...)

Neural Information Processing Systems

Jun-11-2026, 10:04:43 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.82)