Goto

Collaborating Authors

 South America


IBM Launches Artificial Intelligence Centre In Brazil

#artificialintelligence

Introduced in 2019, by IBM, Brazil has launched the largest research facility, that focuses on artificial intelligence, through a collaboration between the private and public sector. The Artificial Intelligence Center (C4AI) is supported by investments made by IBM along with the São Paulo Research Foundation (FAPESP) and the University of São Paulo (USP). This AI centre -- C4AI has been established to tackle five significant challenges that are related to health, the environment, the food production chain, the future of work and the development of NLP technologies in Portuguese. Along with this, it will also aid in projects relating to human wellbeing improvement as well as initiatives focused on diversity and inclusion. The total investment in the AI centre will reach $20 million over the next ten years, which will be split among the investors. The USP will contribute $1 million to cover costs related to the physical set-up of the space, as well as over 70 lecturers and staff to run the centre.


6 Future Trends Everyone Has To Be Ready For Today

#artificialintelligence

I had the pleasure of talking with futurist and the managing partner of ChangeistScott Smith recently about some of the biggest macro trends everyone should be aware of today. While these trends had already begun prior to the coronavirus pandemic, in many ways, they accelerated as the world fought to deal with the pandemic and now as we begin to build our post-COVID-19 world. Here are the six future trends he believes everyone should be ready for. The "decoupling" of economies had already started pre-COVID-19 with early indicators appearing five to 10 years ago, according to some thought leaders, but the pandemic certainly made it more clear how dependence on globalization could create vulnerabilities. Some of the world's major powers, such as the UK, the United States, Brazil, Russia, India, and parts of the European Union, had already started to favor nationalism.


Towards Heterogeneous Multi-Agent Reinforcement Learning with Graph Neural Networks

arXiv.org Artificial Intelligence

This work proposes a neural network architecture that learns policies for multiple agent classes in a heterogeneous multi-agent reinforcement setting. The proposed network uses directed labeled graph representations for states, encodes feature vectors of different sizes for different entity classes, uses relational graph convolution layers to model different communication channels between entity types and learns distinct policies for different agent classes, sharing parameters wherever possible. Results have shown that specializing the communication channels between entity classes is a promising step to achieve higher performance in environments composed of heterogeneous entities.


An Eager Splitting Strategy for Online Decision Trees

arXiv.org Machine Learning

We study the effectiveness of replacing the split strategy for the state-of-the-art online tree learner, Hoeffding Tree, with a rigorous but more eager splitting strategy. Our method, Hoeffding AnyTime Tree (HATT), uses the Hoeffding Test to determine whether the current best candidate split is superior to the current split, with the possibility of revision, while Hoeffding Tree aims to determine whether the top candidate is better than the second best and fixes it for all posterity. Our method converges to the ideal batch tree while Hoeffding Tree does not. Decision tree ensembles are widely used in practice, and in this work, we study the efficacy of HATT as a base learner for online bagging and online boosting ensembles. On UCI and synthetic streams, the success of Hoeffding AnyTime Tree in terms of prequential accuracy over Hoeffding Tree is established. HATT as a base learner component outperforms HT within a 0.05 significance level for the majority of tested ensembles on what we believe is the largest and most comprehensive set of testbenches in the online learning literature. Our results indicate that HATT is a superior alternative to Hoeffding Tree in a large number of ensemble settings.


Deep Importance Sampling based on Regression for Model Inversion and Emulation

arXiv.org Machine Learning

Understanding systems by forward and inverse modeling is a recurrent topic of research in many domains of science and engineering. In this context, Monte Carlo methods have been widely used as powerful tools for numerical inference and optimization. They require the choice of a suitable proposal density that is crucial for their performance. For this reason, several adaptive importance sampling (AIS) schemes have been proposed in the literature. We here present an AIS framework called Regression-based Adaptive Deep Importance Sampling (RADIS). In RADIS, the key idea is the adaptive construction via regression of a non-parametric proposal density (i.e., an emulator), which mimics the posterior distribution and hence minimizes the mismatch between proposal and target densities. RADIS is based on a deep architecture of two (or more) nested IS schemes, in order to draw samples from the constructed emulator. The algorithm is highly efficient since employs the posterior approximation as proposal density, which can be improved adding more support points. As a consequence, RADIS asymptotically converges to an exact sampler under mild conditions. Additionally, the emulator produced by RADIS can be in turn used as a cheap surrogate model for further studies. We introduce two specific RADIS implementations that use Gaussian Processes (GPs) and Nearest Neighbors (NN) for constructing the emulator. Several numerical experiments and comparisons show the benefits of the proposed schemes. A real-world application in remote sensing model inversion and emulation confirms the validity of the approach.


On the Adversarial Robustness of LASSO Based Feature Selection

arXiv.org Machine Learning

In this paper, we investigate the adversarial robustness of feature selection based on the $\ell_1$ regularized linear regression model, namely LASSO. In the considered model, there is a malicious adversary who can observe the whole dataset, and then will carefully modify the response values or the feature matrix in order to manipulate the selected features. We formulate the modification strategy of the adversary as a bi-level optimization problem. Due to the difficulty of the non-differentiability of the $\ell_1$ norm at the zero point, we reformulate the $\ell_1$ norm regularizer as linear inequality constraints. We employ the interior-point method to solve this reformulated LASSO problem and obtain the gradient information. Then we use the projected gradient descent method to design the modification strategy. In addition, We demonstrate that this method can be extended to other $\ell_1$ based feature selection methods, such as group LASSO and sparse group LASSO. Numerical examples with synthetic and real data illustrate that our method is efficient and effective.


Taming Discrete Integration via the Boon of Dimensionality

arXiv.org Artificial Intelligence

Discrete integration is a fundamental problem in computer science that concerns the computation of discrete sums over exponentially large sets. Despite intense interest from researchers for over three decades, the design of scalable techniques for computing estimates with rigorous guarantees for discrete integration remains the holy grail. The key contribution of this work addresses this scalability challenge via an efficient reduction of discrete integration to model counting. The proposed reduction is achieved via a significant increase in the dimensionality that, contrary to conventional wisdom, leads to solving an instance of the relatively simpler problem of model counting. Building on the promising approach proposed by Chakraborty et al [9], our work overcomes the key weakness of their approach: a restriction to dyadic weights. We augment our proposed reduction, called DeWeight, with a state of the art efficient approximate model counter and perform detailed empirical analysis over benchmarks arising from neural network verification domains, an emerging application area of critical importance. DeWeight, to the best of our knowledge, is the first technique to compute estimates with provable guarantees for this class of benchmarks.


A Survey on Deep Learning and Explainability for Automatic Image-based Medical Report Generation

arXiv.org Artificial Intelligence

Research over the last five years shows a clear improvement in computer-aided detection (CAD), specifically in disease prediction from medical images [36, 61, 105, 130, 134] as well as from Electronic Health Records (EHR) [113], by using deep neural networks (DNN) and treating the problem as supervised classification or segmentation tasks. Recently, Topol [129] indicates that the need for diagnosis and reporting from image-based examinations far exceeds the current medical capacity of physicians in the US. This situation promotes the development of automatic image-based diagnosis as well as automatic reporting. Furthermore, the lack of specialist physicians is even more critical in resource-limited countries [111], and therefore the expected impacts of this technology would become even more relevant. However, the elaboration of high-quality medical reports from medical images, such as chest X-rays, computed tomography (CT) or magnetic resonance (MRI) scans, is a task that requires a trained radiologist with years of experience.


Modeling Content and Context with Deep Relational Learning

arXiv.org Artificial Intelligence

Building models for realistic natural language tasks requires dealing with long texts and accounting for complicated structural dependencies. Neural-symbolic representations have emerged as a way to combine the reasoning capabilities of symbolic methods, with the expressiveness of neural networks. However, most of the existing frameworks for combining neural and symbolic representations have been designed for classic relational learning tasks that work over a universe of symbolic entities and relations. In this paper, we present DRaiL, an open-source declarative framework for specifying deep relational models, designed to support a variety of NLP scenarios. Our framework supports easy integration with expressive language encoders, and provides an interface to study the interactions between representation, inference and learning.


Open Question Answering over Tables and Text

arXiv.org Artificial Intelligence

In open question answering (QA), the answer to a question is produced by retrieving and then analyzing documents that might contain answers to the question. Most open QA systems have considered only retrieving information from unstructured text. Here we consider for the first time open QA over both tabular and textual data and present a new large-scale dataset Open Table-Text Question Answering (OTT-QA) to evaluate performance on this task. Most questions in OTT-QA require multi-hop inference across tabular data and unstructured text, and the evidence required to answer a question can be distributed in different ways over these two types of input, making evidence retrieval challenging---our baseline model using an iterative retriever and BERT-based reader achieves an exact match score less than 10%. We then propose two novel techniques to address the challenge of retrieving and aggregating evidence for OTT-QA. The first technique is to use "early fusion" to group multiple highly relevant tabular and textual units into a fused block, which provides more context for the retriever to search for. The second technique is to use a cross-block reader to model the cross-dependency between multiple retrieved evidences with global-local sparse attention. Combining these two techniques improves the score significantly, to above 27%.