Spatial Distribution-Shift Aware Knowledge-Guided Machine Learning

Sharma, Arun, Farhadloo, Majid, Yang, Mingzhou, Zeng, Ruolei, Ghosh, Subhankar, Shekhar, Shashi

arXiv.org Artificial Intelligence 

Given inputs of diverse soil characteristics, and climate data gathered from various regions, we aimed to build a model to predict accurate land emissions. The problem is important since accurate quantification of the carbon cycle in agroecosystems is crucial for mitigating climate change and ensuring sustainable food production. Predicting accurate land emissions is challenging due to since calibrating heterogeneous nature of soil properties, moisture, and environmental conditions is hard at decision-relevant scales. Traditional approaches do not adequately estimate land emissions due to location-independent parameters failing to leverage the spatial heterogeneity and also require large datasets. To overcome these limitations, we proposed Spatial Distribution-Shift A ware Knowledge-Guided Machine Learning (SDSA-KGML) which leverage location-dependent parameters which accounts significant spatial heterogeneity in soil moisture from multiple sites within the same region. Experimental results demonstrate that SDSA-KGML models achieve higher local accuracy for the specified states in the Midwest Region.