Collaborating Authors

Artificial intelligence locates "invisible" water in Mali and Chad


Using algorithms and artificial intelligence, a research team led by Universidad Complutense de Madrid (UCM) has designed a tool which, in its initial trials, proved capable of predicting those areas with best access to potable groundwater in Africa, with a success rate of close to 90%. In specific terms, the papers published in Hydrology and Earth System Science and Geocarto International describe the hydrogeological mapping performed by the MLMapper software in the regions of Bamako and Koulikoro (Mali) and the region of Ouaddaï (Chad), respectively. "Ensure access to water and sanitation for all" is Sustainable Development Goal 6. In sub-Saharan Africa, groundwater plays a fundamental role in the supply of drinking water, but the percentage of wells that strike water is very often lower than 30%. "This is mainly because of a lack of hydrogeological knowledge, with the practical consequence that millions of euros of humanitarian aid are lost in fruitless drilling operations", underlines Víctor Gómez-Escalonilla Canales, a researcher at UCM's Department of Geodynamics, Stratigraphy and Palaeontology.

Modern strategies for time series regression Machine Learning

This paper discusses several modern approaches to regression analysis involving time series data where some of the predictor variables are also indexed by time. We discuss classical statistical approaches as well as methods that have been proposed recently in the machine learning literature. The approaches are compared and contrasted, and it will be seen that there are advantages and disadvantages to most currently available approaches. There is ample room for methodological developments in this area. The work is motivated by an application involving the prediction of water levels as a function of rainfall and other climate variables in an aquifer in eastern Australia.

Machine learning in agricultural and applied economics


This review presents machine learning (ML) approaches from an applied economist's perspective. We first introduce the key ML methods drawing connections to econometric practice. We then identify current limitations of the econometric and simulation model toolbox in applied economics and explore potential solutions afforded by ML. We dive into cases such as inflexible functional forms, unstructured data sources and large numbers of explanatory variables in both prediction and causal analysis, and highlight the challenges of complex simulation models. Finally, we argue that economists have a vital role in addressing the shortcomings of ML when used for quantitative economic analysis. Machine learning (ML) offers great potential for expanding the applied economist's toolbox. ML tools are beginning to be employed in economic analysis (März et al., 2016; Crane-Droesch, 2017; Athey, 2019), while some researchers raise concerns about their transparency, interpretability and use for ...

Integration of adversarial autoencoders with residual dense convolutional networks for inversion of solute transport in non-Gaussian conductivity fields Machine Learning

Characterization of a non-Gaussian channelized conductivity field in subsurface flow and transport modeling through inverse modeling usually leads to a high-dimensional inverse problem and requires repeated evaluations of the forward model. In this study, we develop a convolutional adversarial autoencoder (CAAE) network to parameterize the high-dimensional non-Gaussian conductivity fields using a low-dimensional latent representation and a deep residual dense convolutional network (DRDCN) to efficiently construct a surrogate model for the forward model. The two networks are both based on a multilevel residual learning architecture called residual-in-residual dense block. The multilevel residual learning strategy and the dense connection structure in the dense block ease the training of deep networks, enabling us to efficiently build deeper networks that have an essentially increased capacity for approximating mappings of very high-complexity. The CCAE and DRDCN networks are incorporated into an iterative local updating ensemble smoother to formulate an inversion framework. The integrated method is demonstrated using a synthetic solute transport model. Results indicate that CAAE is a robust parameterization method for the channelized conductivity fields with Gaussian conductivities within each facies. The DRDCN network is able to obtain an accurate surrogate model of the forward model with high-dimensional and highly-complex concentration fields using relatively limited training data. The CAAE paramterization approach and the DRDCN surrogate method together significantly reduce the number of forward model runs required to achieve accurate inversion results.

Targeted Source Detection for Environmental Data Machine Learning

In the face of growing needs for water and energy, a fundamental understanding of the environmental impacts of human activities becomes critical for managing water and energy resources, remedying water pollution, and making regulatory policy wisely. Among activities that impact the environment, oil and gas production, wastewater transport, and urbanization are included. In addition to the occurrence of anthropogenic contamination, the presence of some contaminants (e.g., methane, salt, and sulfate) of natural origin is not uncommon. Therefore, scientists sometimes find it difficult to identify the sources of contaminants in the coupled natural and human systems. In this paper, we propose a technique to simultaneously conduct source detection and prediction, which outperforms other approaches in the interdisciplinary case study of the identification of potential groundwater contamination within a region of high-density shale gas development.