Goto

Collaborating Authors

 South America


Text Classification of the Precursory Accelerating Seismicity Corpus: Inference on some Theoretical Trends in Earthquake Predictability Research from 1988 to 2018

arXiv.org Machine Learning

Text analytics based on supervised machine learning classifiers has shown great promise in a multitude of domains, but has yet to be applied to Seismology. We test various standard models (Naive Bayes, k-Nearest Neighbors, Support Vector Machines, and Random Forests) on a seismological corpus of 100 articles related to the topic of precursory accelerating seismicity, spanning from 1988 to 2010. This corpus was labelled in Mignan (2011) with the precursor whether explained by critical processes (i.e., cascade triggering) or by other processes (such as signature of main fault loading). We investigate rather the classification process can be automatized to help analyze larger corpora in order to better understand trends in earthquake predictability research. We find that the Naive Bayes model performs best, in agreement with the machine learning literature for the case of small datasets, with cross-validation accuracies of 86% for binary classification. For a refined multiclass classification ('non-critical process' < 'agnostic' < 'critical process assumed' < 'critical process demonstrated'), we obtain up to 78% accuracy. Prediction on a dozen of articles published since 2011 shows however a weak generalization with a F1-score of 60%, only slightly better than a random classifier, which can be explained by a change of authorship and use of different terminologies. Yet, the model shows F1-scores greater than 80% for the two multiclass extremes ('non-critical process' versus 'critical process demonstrated') while it falls to random classifier results (around 25%) for papers labelled 'agnostic' or 'critical process assumed'. Those results are encouraging in view of the small size of the corpus and of the high degree of abstraction of the labelling. Domain knowledge engineering remains essential but can be made transparent by an investigation of Naive Bayes keyword posterior probabilities.


Current Trends and Future Research Directions for Interactive Music

arXiv.org Artificial Intelligence

Technology has shaped the way on which we compose and produce music: Notably, the invention of microphones, magnetic tapes, amplifiers and computers pushed the development of new music styles in the 20th century. In fact, several artistic domains have been benefiting from such technology developments; for instance, Experimental music, nonlinear music, Electroacoustic music, and interactive music. Experimental music is composed in such a way that its outcome is often unforeseeable; for instance, it may contain random generated tones, computer-generated content, variable-duration notes and "free" content. It may also include atonal melodies and microtones. Another domain is nonlinear music, in which the scenario is divided in parts whose order can be chosen at execution time. We will use the term "nonlinear" music in that sense. Nonlinear music exists from many centuries ago; for instance, Mozart's minuets in which the order of work's musical material was determined by coin-tosses. Electroacoustic music was originated by the incorporation of electronic sound production into compositional practice.


Map Memorization and Forgetting in the IARA Autonomous Car

arXiv.org Artificial Intelligence

Abstract--In this work, we present a novel strategy for correcting imperfections in occupancy grid maps called map decay. The objective of map decay is to correct invalid occupancy probabilities of map cells that are unobservable by sensors. The strategy was inspired by an analogy between the memory architecture believed to exist in the human brain and the maps maintained by an autonomous vehicle. It consists in merging sensory information obtained during runtime (online) with a priori data from a high-precision map constructed offline. In map decay, cells observed by sensors are updated using traditional occupancy grid mapping techniques and unobserved cells are adjusted so that their occupancy probabilities tend to the values found in the offline map. This strategy is grounded in the idea that the most precise information available about an unobservable cell is the value found in the high-precision offline map. Map decay was successfully tested and is still in use in the IARA autonomous vehicle from Universidade Federal do Espírito Santo. The brain allows humans to operate in highly dynamic and complex environments, and to solve general purpose problems. The idea of giving these abilities to artificial entities by reproducing the brain's cognitive processes always fascinated researchers.


Multitask Learning on Graph Neural Networks - Learning Multiple Graph Centrality Measures with a Unified Network

arXiv.org Artificial Intelligence

The application of deep learning to symbolic domains remains an active research endeavour. Graph neural networks (GNN), consisting of trained neural modules which can be arranged in different topologies at run time, are sound alternatives to tackle relational problems which lend themselves to graph representations. In this paper, we show that GNNs are capable of multitask learning, which can be naturally enforced by training the model to refine a single set of multidimensional embeddings $\in \mathbb{R}^d$ and decode them into multiple outputs by connecting MLPs at the end of the pipeline. We demonstrate the multitask learning capability of the model in the relevant relational problem of estimating network centrality measures, i.e. is vertex $v_1$ more central than vertex $v_2$ given centrality $c$?. We then show that a GNN can be trained to develop a $lingua$ $franca$ of vertex embeddings from which all relevant information about any of the trained centrality measures can be decoded. The proposed model achieves $89\%$ accuracy on a test dataset of random instances with up to 128 vertices and is shown to generalise to larger problem sizes. The model is also shown to obtain reasonable accuracy on a dataset of real world instances with up to 4k vertices, vastly surpassing the sizes of the largest instances with which the model was trained ($n=128$). Finally, we believe that our contributions attest to the potential of GNNs in symbolic domains in general and in relational learning in particular.


Feature Selection Approach with Missing Values Conducted for Statistical Learning: A Case Study of Entrepreneurship Survival Dataset

arXiv.org Machine Learning

In this article, we investigate the features which enhanced discriminate the survival in the micro and small business (MSE) using the approach of data mining with feature selection. According to the complexity of the data set, we proposed a comparison of three data imputation methods such as mean imputation (MI), k-nearest neighbor (KNN) and expectation maximization (EM) using mutually the selection of variables technique, whereby t-test, then through the data mining process using logistic regression classification methods, naive Bayes algorithm, linear discriminant analysis and support vector machine hence comparing their respective performances. The experimental results will be spread in developing a model to predict the MSE survival, providing a better understanding in the topic once it is a significant part of the Brazilian' GPA and macroeconomy.


Study: PR pros are optimistic about AI

#artificialintelligence

NEW YORK: Communicators are bullish about artificial intelligence's potential to make their profession more efficient, according to a survey by MSLGroup and Publicis.Sapient. The study, Powered by AI: Communications in the Algorithm Age, surveyed more than 1,846 marketing and communications in-house leaders in Brazil, China, France, Germany, India, Italy, Poland, the U.K., and the U.S. MSL's parent, Publicis Groupe, is rolling out an AI platform called Marcel, which will essentially help its employees better collaborate and form teams more efficiently. MSL CEO Guillaume Herbette said he expects Marcel to be 100% operational by the end of the year. "AI will help us and our clients do better work," Herbette said. "It will add to brand-consumer engagement. They will try more than ever at [maintaining] that relationship."


Coming of Age: Emerging Technologies And The World's Children

#artificialintelligence

Read "technology" and "children" in the same sentence, and you'll probably think about screen time or social media. But technology's implications are vastly more profound: AI, machine learning, big data and automation will fundamentally reshape the lives of our youngest generation. How might we direct the power of emerging innovations to fulfill their rights? One trailblazer addressing this question is Erica Kochi, Co-Founder of UNICEF Innovation at the United Nations Children's Fund, who was named one of TIME's most influential people in the world. Erica continues to accelerate action – unveiling a new urban tech bets opportunity just this week – and to drive crucial dialogues as Co-Chair of the World Economic Forum's Global Future Council on Human Rights.


SAP Forum Brazil 2018: Intelligent Enterprise and Augmented Humanity

#artificialintelligence

On September 11 and 12, 2018, I attended the 22nd edition of the SAP Forum in São Paulo, Brazil, the largest technology and business event in Latin America, with superlative numbers: more than 15,600 participants, more than 9,600 unique participants, 81 sponsors and 423 presentations. It was two intense days, with a lot of technology and innovation, which showed the importance of digital transformation for business. At the opening session of the event, SAP Brazil president Cristina Palmaka set the tone for what would be the event, talking about the interaction between man and machine, being aligned with SAP's purpose of making companies better, impacting the society. And she asked the following questions for reflection: What is the purpose of the company you work for? What would happen in society if tomorrow your company ceased to exist, or if your employment ceased to exist?


Interactive Learning with Corrective Feedback for Policies based on Deep Neural Networks

arXiv.org Machine Learning

Deep Reinforcement Learning (DRL) has obtained unprecedented results in decisionmaking problems, such as playing Atari games [1], or beating the world champion in GO [2]. Nevertheless, in robotic problems, DRL is still limited in applications with real-world systems [3]. Most of the tasks that have been successfully addressed with DRL have two common characteristics: 1) they have well-specified reward functions, and 2) they require large amounts of trials, which means long training periods (or powerful computers) to obtain a satisfying behavior. These two characteristics can be problematic in cases where 1) the goals of the tasks are poorly defined or hard to specify/model (reward function does not exist), 2) the execution of many trials is not feasible (real systems case) and/or not much computational power or time is available, and 3) sometimes additional external perception is necessary for computing the reward/cost function. On the other hand, Machine Learning methods that rely on transfer of human knowledge, Interactive Machine Learning (IML) methods, have shown to be time efficient for obtaining good performance policies and may not require a well-specified reward function; moreover, some methods do not need expert human teachers for training high performance agents [4-6].


META-DES: A Dynamic Ensemble Selection Framework using Meta-Learning

arXiv.org Machine Learning

Dynamic ensemble selection systems work by estimating the level of competence of each classifier from a pool of classifiers. Only the most competent ones are selected to classify a given test sample. This is achieved by defining a criterion to measure the level of competence of a base classifier, such as, its accuracy in local regions of the feature space around the query instance. However, using only one criterion about the behavior of a base classifier is not sufficient to accurately estimate its level of competence. In this paper, we present a novel dynamic ensemble selection framework using meta-learning. We propose five distinct sets of meta-features, each one corresponding to a different criterion to measure the level of competence of a classifier for the classification of input samples. The meta-features are extracted from the training data and used to train a meta-classifier to predict whether or not a base classifier is competent enough to classify an input instance. During the generalization phase, the meta-features are extracted from the query instance and passed down as input to the meta-classifier. The meta-classifier estimates, whether a base classifier is competent enough to be added to the ensemble. Experiments are conducted over several small sample size classification problems, i.e., problems with a high degree of uncertainty due to the lack of training data. Experimental results show the proposed meta-learning framework greatly improves classification accuracy when compared against current state-of-the-art dynamic ensemble selection techniques.