Differentiable modeling to unify machine learning and physical models and advance Geosciences
Shen, Chaopeng, Appling, Alison P., Gentine, Pierre, Bandai, Toshiyuki, Gupta, Hoshin, Tartakovsky, Alexandre, Baity-Jesi, Marco, Fenicia, Fabrizio, Kifer, Daniel, Li, Li, Liu, Xiaofeng, Ren, Wei, Zheng, Yi, Harman, Ciaran J., Clark, Martyn, Farthing, Matthew, Feng, Dapeng, Kumar, Praveen, Aboelyazeed, Doaa, Rahmani, Farshid, Beck, Hylke E., Bindas, Tadd, Dwivedi, Dipankar, Fang, Kuai, Höge, Marvin, Rackauckas, Chris, Roy, Tirthankar, Xu, Chonggang, Mohanty, Binayak, Lawson, Kathryn
–arXiv.org Artificial Intelligence
Process-Based Modeling (PBM) and Machine Learning (ML) are often perceived as distinct paradigms in the geosciences. Here we present differentiable geoscientific modeling as a powerful pathway toward dissolving the perceived barrier between them and ushering in a paradigm shift. For decades, PBM offered benefits in interpretability and physical consistency but struggled to efficiently leverage large datasets. ML methods, especially deep networks, presented strong predictive skills yet lacked the ability to answer specific scientific questions. While various methods have been proposed for ML-physics integration, an important underlying theme -- differentiable modeling -- is not sufficiently recognized. Here we outline the concepts, applicability, and significance of differentiable geoscientific modeling (DG). "Differentiable" refers to accurately and efficiently calculating gradients with respect to model variables, critically enabling the learning of high-dimensional unknown relationships. DG refers to a range of methods connecting varying amounts of prior knowledge to neural networks and training them together, capturing a different scope than physics-guided machine learning and emphasizing first principles. Preliminary evidence suggests DG offers better interpretability and causality than ML, improved generalizability and extrapolation capability, and strong potential for knowledge discovery, while approaching the performance of purely data-driven ML. DG models require less training data while scaling favorably in performance and efficiency with increasing amounts of data. With DG, geoscientists may be better able to frame and investigate questions, test hypotheses, and discover unrecognized linkages.
arXiv.org Artificial Intelligence
Dec-26-2023
- Country:
- Africa (0.04)
- Asia
- China > Guangdong Province
- Shenzhen (0.04)
- Middle East
- Jordan (0.04)
- Saudi Arabia > Mecca Province
- Thuwal (0.04)
- China > Guangdong Province
- Atlantic Ocean > North Atlantic Ocean
- Chesapeake Bay (0.04)
- Europe
- Bosnia and Herzegovina (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Switzerland (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- North America
- Canada
- Alberta (0.04)
- Saskatchewan (0.04)
- Puerto Rico > Luquillo
- Luquillo (0.04)
- United States
- New York > New York County
- New York City (0.04)
- California
- Alameda County > Berkeley (0.04)
- Merced County > Merced (0.14)
- Santa Clara County > Stanford (0.04)
- Massachusetts (0.04)
- Mississippi > Warren County
- Vicksburg (0.04)
- Connecticut > Tolland County
- Storrs (0.14)
- Texas > Brazos County
- College Station (0.04)
- Iowa (0.04)
- Pennsylvania > Centre County
- University Park (0.04)
- Virginia > Fairfax County
- Reston (0.04)
- Oregon (0.04)
- Arizona > Pima County
- Tucson (0.14)
- New Mexico > Los Alamos County
- Los Alamos (0.04)
- Maryland > Baltimore (0.04)
- Nebraska > Lancaster County
- Lincoln (0.14)
- Illinois > Champaign County
- New York > New York County
- Canada
- Genre:
- Research Report (1.00)
- Industry:
- Technology: