Differentiable modeling to unify machine learning and physical models and advance Geosciences
Shen, Chaopeng, Appling, Alison P., Gentine, Pierre, Bandai, Toshiyuki, Gupta, Hoshin, Tartakovsky, Alexandre, Baity-Jesi, Marco, Fenicia, Fabrizio, Kifer, Daniel, Li, Li, Liu, Xiaofeng, Ren, Wei, Zheng, Yi, Harman, Ciaran J., Clark, Martyn, Farthing, Matthew, Feng, Dapeng, Kumar, Praveen, Aboelyazeed, Doaa, Rahmani, Farshid, Beck, Hylke E., Bindas, Tadd, Dwivedi, Dipankar, Fang, Kuai, Höge, Marvin, Rackauckas, Chris, Roy, Tirthankar, Xu, Chonggang, Mohanty, Binayak, Lawson, Kathryn
–arXiv.org Artificial Intelligence
Process-Based Modeling (PBM) and Machine Learning (ML) are often perceived as distinct paradigms in the geosciences. Here we present differentiable geoscientific modeling as a powerful pathway toward dissolving the perceived barrier between them and ushering in a paradigm shift. For decades, PBM offered benefits in interpretability and physical consistency but struggled to efficiently leverage large datasets. ML methods, especially deep networks, presented strong predictive skills yet lacked the ability to answer specific scientific questions. While various methods have been proposed for ML-physics integration, an important underlying theme -- differentiable modeling -- is not sufficiently recognized. Here we outline the concepts, applicability, and significance of differentiable geoscientific modeling (DG). "Differentiable" refers to accurately and efficiently calculating gradients with respect to model variables, critically enabling the learning of high-dimensional unknown relationships. DG refers to a range of methods connecting varying amounts of prior knowledge to neural networks and training them together, capturing a different scope than physics-guided machine learning and emphasizing first principles. Preliminary evidence suggests DG offers better interpretability and causality than ML, improved generalizability and extrapolation capability, and strong potential for knowledge discovery, while approaching the performance of purely data-driven ML. DG models require less training data while scaling favorably in performance and efficiency with increasing amounts of data. With DG, geoscientists may be better able to frame and investigate questions, test hypotheses, and discover unrecognized linkages.
arXiv.org Artificial Intelligence
Dec-26-2023
- Country:
- Africa (0.04)
- North America
- United States
- Massachusetts (0.04)
- Maryland > Baltimore (0.04)
- Oregon (0.04)
- Iowa (0.04)
- Illinois > Champaign County
- Nebraska > Lancaster County
- Lincoln (0.14)
- New Mexico > Los Alamos County
- Los Alamos (0.04)
- Arizona > Pima County
- Tucson (0.14)
- Virginia > Fairfax County
- Reston (0.04)
- Pennsylvania > Centre County
- University Park (0.04)
- Texas > Brazos County
- College Station (0.04)
- Connecticut > Tolland County
- Storrs (0.14)
- Mississippi > Warren County
- Vicksburg (0.04)
- California
- Merced County > Merced (0.14)
- Santa Clara County > Stanford (0.04)
- Alameda County > Berkeley (0.04)
- New York > New York County
- New York City (0.04)
- Puerto Rico > Luquillo
- Luquillo (0.04)
- Canada
- Saskatchewan (0.04)
- Alberta (0.04)
- United States
- Europe
- Switzerland (0.04)
- Bosnia and Herzegovina (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Atlantic Ocean > North Atlantic Ocean
- Chesapeake Bay (0.04)
- Asia
- Middle East
- Jordan (0.04)
- Saudi Arabia > Mecca Province
- Thuwal (0.04)
- China > Guangdong Province
- Shenzhen (0.04)
- Middle East
- Genre:
- Research Report (1.00)
- Industry:
- Technology: