Goto

Collaborating Authors

 South America


Depth Uncertainty in Neural Networks

arXiv.org Machine Learning

Existing methods for estimating uncertainty in deep learning tend to require multiple forward passes, making them unsuitable for applications where computational resources are limited. To solve this, we perform probabilistic reasoning over the depth of neural networks. Different depths correspond to subnetworks which share weights and whose predictions are combined via marginalisation, yielding model uncertainty. By exploiting the sequential structure of feed-forward networks, we are able to both evaluate our training objective and make predictions with a single forward pass. We validate our approach on real-world regression and image classification tasks. Our approach provides uncertainty calibration, robustness to dataset shift, and accuracies competitive with more computationally expensive baselines.


A novel embedded min-max approach for feature selection in nonlinear support vector machine classification

arXiv.org Machine Learning

In recent years, feature selection has become a challenging problem in several machine learning fields, such as classification problems. Support Vector Machine (SVM) is a well-known technique applied in classification tasks. Various methodologies have been proposed in the literature to select the most relevant features in SVM. Unfortunately, all of them either deal with the feature selection problem in the linear classification setting or propose ad-hoc approaches that are difficult to implement in practice. In contrast, we propose an embedded feature selection method based on a min-max optimization problem, where a trade-off between model complexity and classification accuracy is sought. By leveraging duality theory, we equivalently reformulate the min-max problem and solve it without further ado using off-the-shelf software for nonlinear optimization. The efficiency and usefulness of our approach are tested on several benchmark data sets in terms of accuracy, number of selected features and interpretability.


Exemplary Natural Images Explain CNN Activations Better than Feature Visualizations

arXiv.org Artificial Intelligence

Feature visualizations such as synthetic maximally activating images are a widely used explanation method to better understand the information processing of convolutional neural networks (CNNs). At the same time, there are concerns that these visualizations might not accurately represent CNNs' inner workings. Here, we measure how much extremely activating images help humans to predict CNN activations. Using a well-controlled psychophysical paradigm, we compare the informativeness of synthetic images (Olah et al., 2017) with a simple baseline visualization, namely exemplary natural images that also strongly activate a specific feature map. Given either synthetic or natural reference images, human participants choose which of two query images leads to strong positive activation. The experiment is designed to maximize participants' performance, and is the first to probe intermediate instead of final layer representations. We find that synthetic images indeed provide helpful information about feature map activations (82% accuracy; chance would be 50%). However, natural images-originally intended to be a baseline-outperform synthetic images by a wide margin (92% accuracy). Additionally, participants are faster and more confident for natural images, whereas subjective impressions about the interpretability of feature visualization are mixed. The higher informativeness of natural images holds across most layers, for both expert and lay participants as well as for hand- and randomly-picked feature visualizations. Even if only a single reference image is given, synthetic images provide less information than natural images (65% vs. 73%). In summary, popular synthetic images from feature visualizations are significantly less informative for assessing CNN activations than natural images. We argue that future visualization methods should improve over this simple baseline.


A Software Architecture for Autonomous Vehicles: Team LRM-B Entry in the First CARLA Autonomous Driving Challenge

arXiv.org Artificial Intelligence

The objective of the first CARLA autonomous driving challenge was to deploy autonomous driving systems to lead with complex traffic scenarios where all participants faced the same challenging traffic situations. According to the organizers, this competition emerges as a way to democratize and to accelerate the research and development of autonomous vehicles around the world using the CARLA simulator contributing to the development of the autonomous vehicle area. Therefore, this paper presents the architecture design for the navigation of an autonomous vehicle in a simulated urban environment that attempts to commit the least number of traffic infractions, which used as the baseline the original architecture of the platform for autonomous navigation CaRINA 2. Our agent traveled in simulated scenarios for several hours, demonstrating his capabilities, winning three out of the four tracks of the challenge, and being ranked second in the remaining track. Our architecture was made towards meeting the requirements of CARLA Autonomous Driving Challenge and has components for obstacle detection using 3D point clouds, traffic signs detection and classification which employs Convolutional Neural Networks (CNN) and depth information, risk assessment with collision detection using short-term motion prediction, decision-making with Markov Decision Process (MDP), and control using Model Predictive Control (MPC).


Model Interpretability through the Lens of Computational Complexity

arXiv.org Artificial Intelligence

In spite of several claims stating that some models are more interpretable than others -- e.g., "linear models are more interpretable than deep neural networks" -- we still lack a principled notion of interpretability to formally compare among different classes of models. We make a step towards such a notion by studying whether folklore interpretability claims have a correlate in terms of computational complexity theory. We focus on local post-hoc explainability queries that, intuitively, attempt to answer why individual inputs are classified in a certain way by a given model. In a nutshell, we say that a class $\mathcal{C}_1$ of models is more interpretable than another class $\mathcal{C}_2$, if the computational complexity of answering post-hoc queries for models in $\mathcal{C}_2$ is higher than for those in $\mathcal{C}_1$. We prove that this notion provides a good theoretical counterpart to current beliefs on the interpretability of models; in particular, we show that under our definition and assuming standard complexity-theoretical assumptions (such as P$\neq$NP), both linear and tree-based models are strictly more interpretable than neural networks. Our complexity analysis, however, does not provide a clear-cut difference between linear and tree-based models, as we obtain different results depending on the particular post-hoc explanations considered. Finally, by applying a finer complexity analysis based on parameterized complexity, we are able to prove a theoretical result suggesting that shallow neural networks are more interpretable than deeper ones.


Algorithms Are Making Economic Inequality Worse

#artificialintelligence

The risks of algorithmic discrimination and bias have received much attention and scrutiny, and rightly so. Yet there is another more insidious side-effect of our increasingly AI-powered society -- the systematic inequality created by the changing nature of work itself. We fear a future where robots take our jobs, but what happens when a significant portion of the workforce ends up in algorithmically managed jobs with little future and few possibilities for advancement? One of the classic tropes of self-made success is the leader who comes from humble beginnings, working their way up from the mailroom, the cash register, or the factory floor. And while doing that is considerably tougher than Hollywood might suggest, bottom-up mobility was at least possible in traditional organizations.


Inborn errors of type I IFN immunity in patients with life-threatening COVID-19

Science

The immune system is complex and involves many genes, including those that encode cytokines known as interferons (IFNs). Individuals that lack specific IFNs can be more susceptible to infectious diseases. Furthermore, the autoantibody system dampens IFN response to prevent damage from pathogen-induced inflammation. Two studies now examine the likelihood that genetics affects the risk of severe coronavirus disease 2019 (COVID-19) through components of this system (see the Perspective by Beck and Aksentijevich). Q. Zhang et al. used a candidate gene approach and identified patients with severe COVID-19 who have mutations in genes involved in the regulation of type I and III IFN immunity. They found enrichment of these genes in patients and conclude that genetics may determine the clinical course of the infection. Bastard et al. identified individuals with high titers of neutralizing autoantibodies against type I IFN-α2 and IFN-ω in about 10% of patients with severe COVID-19 pneumonia. These autoantibodies were not found either in infected people who were asymptomatic or had milder phenotype or in healthy individuals. Together, these studies identify a means by which individuals at highest risk of life-threatening COVID-19 can be identified. Science , this issue p. [eabd4570][1], p. [eabd4585][2]; see also p. [404][3] ### INTRODUCTION Clinical outcomes of human severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection range from silent infection to lethal coronavirus disease 2019 (COVID-19). Epidemiological studies have identified three risk factors for severe disease: being male, being elderly, and having other medical conditions. However, interindividual clinical variability remains huge in each demographic category. Discovering the root cause and detailed molecular, cellular, and tissue- and body-level mechanisms underlying life-threatening COVID-19 is of the utmost biological and medical importance. ### RATIONALE We established the COVID Human Genetic Effort ([www.covidhge.com][4]) to test the general hypothesis that life-threatening COVID-19 in some or most patients may be caused by monogenic inborn errors of immunity to SARS-CoV-2 with incomplete or complete penetrance. We sequenced the exome or genome of 659 patients of various ancestries with life-threatening COVID-19 pneumonia and 534 subjects with asymptomatic or benign infection. We tested the specific hypothesis that inborn errors of Toll-like receptor 3 (TLR3)– and interferon regulatory factor 7 (IRF7)–dependent type I interferon (IFN) immunity that underlie life-threatening influenza pneumonia also underlie life-threatening COVID-19 pneumonia. We considered three loci identified as mutated in patients with life-threatening influenza: TLR3 , IRF7 , and IRF9 . We also considered 10 loci mutated in patients with other viral illnesses but directly connected to the three core genes conferring influenza susceptibility: TICAM1/TRIF , UNC93B1 , TRAF3 , TBK1 , IRF3 , and NEMO/IKBKG from the TLR3-dependent type I IFN induction pathway, and IFNAR1 , IFNAR2 , STAT1 , and STAT2 from the IRF7- and IRF9-dependent type I IFN amplification pathway. Finally, we considered various modes of inheritance at these 13 loci. ### RESULTS We found an enrichment in variants predicted to be loss-of-function (pLOF), with a minor allele frequency <0.001, at the 13 candidate loci in the 659 patients with life-threatening COVID-19 pneumonia relative to the 534 subjects with asymptomatic or benign infection ( P = 0.01). Experimental tests for all 118 rare nonsynonymous variants (including both pLOF and other variants) of these 13 genes found in patients with critical disease identified 23 patients (3.5%), aged 17 to 77 years, carrying 24 deleterious variants of eight genes. These variants underlie autosomal-recessive (AR) deficiencies ( IRF7 and IFNAR1 ) and autosomal-dominant (AD) deficiencies ( TLR3 , UNC93B1 , TICAM1 , TBK1 , IRF3 , IRF7 , IFNAR1 , and IFNAR2 ) in four and 19 patients, respectively. These patients had never been hospitalized for other life-threatening viral illness. Plasmacytoid dendritic cells from IRF7-deficient patients produced no type I IFN on infection with SARS-CoV-2, and TLR3−/−, TLR3+/−, IRF7−/−, and IFNAR1−/− fibroblasts were susceptible to SARS-CoV-2 infection in vitro. ### CONCLUSION At least 3.5% of patients with life-threatening COVID-19 pneumonia had known (AR IRF7 and IFNAR1 deficiencies or AD TLR3, TICAM1, TBK1, and IRF3 deficiencies) or new (AD UNC93B1, IRF7, IFNAR1, and IFNAR2 deficiencies) genetic defects at eight of the 13 candidate loci involved in the TLR3- and IRF7-dependent induction and amplification of type I IFNs. This discovery reveals essential roles for both the double-stranded RNA sensor TLR3 and type I IFN cell-intrinsic immunity in the control of SARS-CoV-2 infection. Type I IFN administration may be of therapeutic benefit in selected patients, at least early in the course of SARS-CoV-2 infection. ![Figure][5] Inborn errors of TLR3- and IRF7-dependent type I IFN production and amplification underlie life-threatening COVID-19 pneumonia. Molecules in red are encoded by core genes, deleterious variants of which underlie critical influenza pneumonia with incomplete penetrance, and deleterious variants of genes encoding biochemically related molecules in blue underlie other viral illnesses. Molecules represented in bold are encoded by genes with variants that also underlie critical COVID-19 pneumonia. Clinical outcome upon infection with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) ranges from silent infection to lethal coronavirus disease 2019 (COVID-19). We have found an enrichment in rare variants predicted to be loss-of-function (LOF) at the 13 human loci known to govern Toll-like receptor 3 (TLR3)– and interferon regulatory factor 7 (IRF7)–dependent type I interferon (IFN) immunity to influenza virus in 659 patients with life-threatening COVID-19 pneumonia relative to 534 subjects with asymptomatic or benign infection. By testing these and other rare variants at these 13 loci, we experimentally defined LOF variants underlying autosomal-recessive or autosomal-dominant deficiencies in 23 patients (3.5%) 17 to 77 years of age. We show that human fibroblasts with mutations affecting this circuit are vulnerable to SARS-CoV-2. Inborn errors of TLR3- and IRF7-dependent type I IFN immunity can underlie life-threatening COVID-19 pneumonia in patients with no prior severe infection. [1]: /lookup/doi/10.1126/science.abd4570 [2]: /lookup/doi/10.1126/science.abd4585 [3]: /lookup/doi/10.1126/science.abe7591 [4]: https://www.covidhge.com [5]: pending:yes


Intel Powers First Satellite with AI on Board

#artificialintelligence

As ubiquitous as artificial intelligence has become in modern life -- from boosting our understanding of the cosmos to surfacing entertaining videos on your phone -- AI hasn't yet found its way into orbit. That is until Sept. 2, when an experimental satellite about the size of a cereal box was ejected from a rocket's dispenser along with 45 other similarly small satellites. The satellite, named PhiSat-1, is now soaring at over 17,000 mph (27,500 kmh) in sun-synchronous orbit about 329 miles (530 km) overhead. PhiSat-1 contains a new hyperspectral-thermal camera and onboard AI processing thanks to an Intel Movidius Myriad 2 Vision Processing Unit (VPU) -- the same chip inside many smart cameras and even a $99 selfie drone here on Earth. PhiSat-1 is actually one of a pair of satellites on a mission to monitor polar ice and soil moisture, while also testing intersatellite communication systems in order to create a future network of federated satellites.


Achieving User-Side Fairness in Contextual Bandits

arXiv.org Artificial Intelligence

Personalized recommendation based on multi-arm bandit (MAB) algorithms has shown to lead to high utility and efficiency as it can dynamically adapt the recommendation strategy based on feedback. However, unfairness could incur in personalized recommendation. In this paper, we study how to achieve user-side fairness in personalized recommendation. We formulate our fair personalized recommendation as a modified contextual bandit and focus on achieving fairness on the individual whom is being recommended an item as opposed to achieving fairness on the items that are being recommended. We introduce and define a metric that captures the fairness in terms of rewards received for both the privileged and protected groups. We develop a fair contextual bandit algorithm, Fair-LinUCB, that improves upon the traditional LinUCB algorithm to achieve group-level fairness of users. Our algorithm detects and monitors unfairness while it learns to recommend personalized videos to students to achieve high efficiency. We provide a theoretical regret analysis and show that our algorithm has a slightly higher regret bound than LinUCB. We conduct numerous experimental evaluations to compare the performances of our fair contextual bandit to that of LinUCB and show that our approach achieves group-level fairness while maintaining a high utility.


Kernel Smoothing, Mean Shift, and Their Learning Theory with Directional Data

arXiv.org Machine Learning

Directional data consist of observations distributed on a (hyper)sphere, and appear in many applied fields, such as astronomy, ecology, and environmental science. This paper studies both statistical and computational problems of kernel smoothing for directional data. We generalize the classical mean shift algorithm to directional data, which allows us to identify local modes of the directional kernel density estimator (KDE). The statistical convergence rates of the directional KDE and its derivatives are derived, and the problem of mode estimation is examined. We also prove the ascending property of our directional mean shift algorithm and investigate a general problem of gradient ascent on the unit hypersphere. To demonstrate the applicability of our proposed algorithm, we evaluate it as a mode clustering method on both simulated and real-world datasets.