Goto

Collaborating Authors

 nbe


Neural Bayes inference for complex bivariate extremal dependence models

André, Lídia M., Wadsworth, Jennifer L., Huser, Raphaël

arXiv.org Machine Learning

Likelihood-free approaches are appealing for performing inference on complex dependence models, either because it is not possible to formulate a likelihood function, or its evaluation is very computationally costly. This is the case for several models available in the multivariate extremes literature, particularly for the most flexible tail models, including those that interpolate between the two key dependence classes of `asymptotic dependence' and `asymptotic independence'. We focus on approaches that leverage neural networks to approximate Bayes estimators. In particular, we explore the properties of neural Bayes estimators for parameter inference for several flexible but computationally expensive models to fit, with a view to aiding their routine implementation. Owing to the absence of likelihood evaluation in the inference procedure, classical information criteria such as the Bayesian information criterion cannot be used to select the most appropriate model. Instead, we propose using neural networks as neural Bayes classifiers for model selection. Our goal is to provide a toolbox for simple, fast fitting and comparison of complex extreme-value dependence models, where the best model is selected for a given data set and its parameters subsequently estimated using neural Bayes estimation. We apply our classifiers and estimators to analyse the pairwise extremal behaviour of changes in horizontal geomagnetic field fluctuations at three different locations.


Neural Parameter Estimation with Incomplete Data

Sainsbury-Dale, Matthew, Zammit-Mangion, Andrew, Cressie, Noel, Huser, Raphaël

arXiv.org Machine Learning

Advancements in artificial intelligence (AI) and deep learning have led to neural networks being used to generate lightning-speed answers to complex questions, to paint like Monet, or to write like Proust. Leveraging their computational speed and flexibility, neural networks are also being used to facilitate fast, likelihood-free statistical inference. However, it is not straightforward to use neural networks with data that for various reasons are incomplete, which precludes their use in many applications. A recently proposed approach to remedy this issue inputs an appropriately padded data vector and a vector that encodes the missingness pattern to a neural network. While computationally efficient, this "masking" approach can result in statistically inefficient inferences. Here, we propose an alternative approach that is based on the Monte Carlo expectation-maximization (EM) algorithm. Our EM approach is likelihood-free, substantially faster than the conventional EM algorithm as it does not require numerical optimization at each iteration, and more statistically efficient than the masking approach. This research represents a prototype problem that indicates how improvements could be made in AI by introducing Bayesian statistical thinking. We compare the two approaches to missingness using simulated incomplete data from two models: a spatial Gaussian process model, and a spatial Potts model. The utility of the methodology is shown on Arctic sea-ice data and cryptocurrency data.


Dancing in the Shadows: Harnessing Ambiguity for Fairer Classifiers

Barrainkua, Ainhize, Gordaliza, Paula, Lozano, Jose A., Quadrianto, Novi

arXiv.org Artificial Intelligence

Algorithmic systems, designed to streamline decision processes and enhance efficiency, have permeated virtually every aspect of our lives. From credit approvals to hiring decisions, from predictive policing to healthcare recommendations, algorithms wield significant influence. Yet, this influence is not neutral, and the consequences could be disproportionate for diverse communities. Subtle biases embedded in training data, the choices made during model development, and the very nature of algorithmic decision-making are some potential reasons for inequitable treatment of certain demographic groups, perpetuating and, in some instances, exacerbating societal disparities. Consider, for instance, the use of predictive policing algorithms, where certain communities are subjected to heightened surveillance based on historical crime data, perpetuating a cycle of over-policing [9]. Similarly, in hiring practices, algorithms may inadvertently favor certain demographics, leading to underrepresentation and reinforcing existing inequalities in the workplace [6, 5]. Therefore, it is crucial to acknowledge the inherent biases and disparities that have emerged within these systems and propose innovative solutions to enhance their fairness guarantees.


Neural Bayes estimators for censored inference with peaks-over-threshold models

Richards, Jordan, Sainsbury-Dale, Matthew, Zammit-Mangion, Andrew, Huser, Raphaël

arXiv.org Machine Learning

Making inference with spatial extremal dependence models can be computationally burdensome since they involve intractable and/or censored likelihoods. Building on recent advances in likelihood-free inference with neural Bayes estimators, that is, neural networks that approximate Bayes estimators, we develop highly efficient estimators for censored peaks-over-threshold models that encode censoring information in the neural network architecture. Our new method provides a paradigm shift that challenges traditional censored likelihood-based inference methods for spatial extremal dependence models. Our simulation studies highlight significant gains in both computational and statistical efficiency, relative to competing likelihood-based approaches, when applying our novel estimators to make inference with popular extremal dependence models, such as max-stable, $r$-Pareto, and random scale mixture process models. We also illustrate that it is possible to train a single neural Bayes estimator for a general censoring level, precluding the need to retrain the network when the censoring level is changed. We illustrate the efficacy of our estimators by making fast inference on hundreds-of-thousands of high-dimensional spatial extremal dependence models to assess extreme particulate matter 2.5 microns or less in diameter (PM2.5) concentration over the whole of Saudi Arabia.