Goto

Collaborating Authors

 Accuracy


Towards personalized human AI interaction - adapting the behavior of AI agents using neural signatures of subjective interest

arXiv.org Machine Learning

The use of Artificial Neural Networks (ANNs) towards developing Artificial Intelligence (AI) has undergone a renaissance in the past decade. Out of the many emergent techniques for training ANNs that are collectively referred to as'Deep Learning', Deep Reinforcement Learning (DRL) is proving to be a particularly general and powerful method, with applications ranging from video games [1] to autonomous driving [2]. While most applications of reinforcement learning have traditionally used reinforcement signals derived from performance measures that are explicit to the task - e.g. the score in a game or grammatical errors in a translation, when considering AI systems that are required to have a significant interaction with humans - e.g. the autonomous vehicle - it is critical to consider how the human's preference for objects, events, or actions can be incorporated into the behavioral reinforcement for the AI, particularly in ways that are minimally obtrusive [3], [4]. Such behavioral adaptations occur naturally during social interactions and form the bedrock of social mechanisms that build trust and rapport between strangers [5], [6]. In this paper, we present a novel approach that uses decoded human neurophysiological and ocular time-series data as an implicit reinforcement signal for an AI agent that is driving a virtual automobile.


Network cross-validation by edge sampling

arXiv.org Machine Learning

Statistical methods for network data have received a lot of attention because of the wideranging applications of network analysis. There is now a large body of work on methods and models for networks, including the stochastic block model (SBM) [Holland et al., 1983], the degree-corrected stochastic block model (DCSBM) [Karrer and Newman, 2011], and the latent space model [Hoff et al., 2002], to name a few. While this gives the practitioner plenty of choices, there is a lot less work on the crucial question of how to select the best model for the data, as well as how to choose tuning parameters for the selected model, which is often necessary in order to fit it. In some specific problems, progress has been made recently, for instance, in the much-studied problem of community detection. Community detection is the problem of clustering network nodes into groups, and most of the methods proposed over the last twenty years or so require the number of communities K as input.


AI in AML: Present tensed, but future perfect

#artificialintelligence

Today, Financial Institutions (FIs) face significant legal and reputational risks when it comes to complying with anti-money laundering (AML) requirements (including anti-terrorist financing and obligations to conform). Failure can lead to serious sanctions imposed by regulatory bodies (Recently, Societe Generale fined $5.83 MM for a number of shortcomings in its control for preventing money laundering). Today's financial markets are truly global. Transactions and flow of funds take place through a web of interactions across nations and systems. This makes it difficult to be compliant with thousands of regulations and norms across a large number of jurisdictions.


Weighted Message Passing and Minimum Energy Flow for Heterogeneous Stochastic Block Models with Side Information

arXiv.org Machine Learning

We study the misclassification error for community detection in general heterogeneous stochastic block models (SBM) with noisy or partial label information. We establish a connection between the misclassification rate and the notion of minimum energy on the local neighborhood of the SBM. We develop an optimally weighted message passing algorithm to reconstruct labels for SBM based on the minimum energy flow and the eigenvectors of a certain Markov transition matrix. The general SBM considered in this paper allows for unequal-size communities, degree heterogeneity, and different connection probabilities among blocks. We focus on how to optimally weigh the message passing to improve misclassification.


Comparative Benchmarking of Causal Discovery Techniques

arXiv.org Machine Learning

In this paper we present a comprehensive view of prominent causal discovery algorithms, categorized into two main categories (1) assuming acyclic and no latent variables, and (2) allowing both cycles and latent variables, along with experimental results comparing them from three perspectives: (a) structural accuracy, (b) standard predictive accuracy, and (c) accuracy of counterfactual inference. For (b) and (c) we train causal Bayesian networks with structures as predicted by each causal discovery technique to carry out counterfactual or standard predictive inference. We compare causal algorithms on two pub- licly available and one simulated datasets having different sample sizes: small, medium and large. Experiments show that structural accuracy of a technique does not necessarily correlate with higher accuracy of inferencing tasks. Fur- ther, surveyed structure learning algorithms do not perform well in terms of structural accuracy in case of datasets having large number of variables.


The impossibility of "fairness": a generalized impossibility result for decisions

arXiv.org Machine Learning

Various measures can be used to estimate bias or unfairness in a predictor. Previous work has already established that some of these measures are incompatible with each other. Here we show that, when groups differ in prevalence of the predicted event, several intuitive, reasonable measures of fairness (probability of positive prediction given occurrence or non-occurrence; probability of occurrence given prediction or non-prediction; and ratio of predictions over occurrences for each group) are all mutually exclusive: if one of them is equal among groups, the other two must differ. The only exceptions are for perfect, or trivial (always-positive or always-negative) predictors. As a consequence, any non-perfect, non-trivial predictor must necessarily be "unfair" under two out of three reasonable sets of criteria. This result readily generalizes to a wide range of well-known statistical quantities (sensitivity, specificity, false positive rate, precision, etc.), all of which can be divided into three mutually exclusive groups. Importantly, The results applies to all predictors, whether algorithmic or human. We conclude with possible ways to handle this effect when assessing and designing prediction methods.


Reservoir of Diverse Adaptive Learners and Stacking Fast Hoeffding Drift Detection Methods for Evolving Data Streams

arXiv.org Machine Learning

The last decade has seen a surge of interest in adaptive learning algorithms for data stream classification, with applications ranging from predicting ozone level peaks, learning stock market indicators, to detecting computer security violations. In addition, a number of methods have been developed to detect concept drifts in these streams. Consider a scenario where we have a number of classifiers with diverse learning styles and different drift detectors. Intuitively, the current 'best' (classifier, detector) pair is application dependent and may change as a result of the stream evolution. Our research builds on this observation. We introduce the $\mbox{Tornado}$ framework that implements a reservoir of diverse classifiers, together with a variety of drift detection algorithms. In our framework, all (classifier, detector) pairs proceed, in parallel, to construct models against the evolving data streams. At any point in time, we select the pair which currently yields the best performance. We further incorporate two novel stacking-based drift detection methods, namely the $\mbox{FHDDMS}$ and $\mbox{FHDDMS}_{add}$ approaches. The experimental evaluation confirms that the current 'best' (classifier, detector) pair is not only heavily dependent on the characteristics of the stream, but also that this selection evolves as the stream flows. Further, our $\mbox{FHDDMS}$ variants detect concept drifts accurately in a timely fashion while outperforming the state-of-the-art.


Feature selection in high-dimensional dataset using MapReduce

arXiv.org Machine Learning

The exponential growth of data generation, measurements and collection in scientific and engineering disciplines leads to the availability of huge and high-dimensional datasets, in domains as varied as text mining, social network, astronomy or bioinformatics to name a few. The only viable path to the analysis of such datasets is to rely on data-intensive distributed computing frameworks [1]. MapReduce has in the last decade established itself as a reference programming model for distributed computing. The model is articulated around two main classes of functions, mappers and reducers, which greatly decrease the complexity of a distributed program while allowing to express a wide range of computing tasks. MapReduce was popularised by Google research in 2008 [2], and may be executed on parallel computing platforms ranging from specialised hardware units such as parallel field programmable gate arrays (FPGAs) and graphics processing units, to large clusters of commodity machine using for example the Hadoop or Spark frameworks [2]-[4]. In particular, the expressiveness of the MapReduce programming model has led to the design of advanced distributed data processing libraries for machine learning and data mining, such as Hadoop Mahout and Spark MLlib. Many of the standard supervised and unsupervised learning techniques (linear and logistic regression, naive Bayes, SVM, random forest, PCA) are now available from these libraries [5]-[7]. Little attention has however yet been given to feature selection algorithms (FSA), which form an essential component of machine learning and data mining workflows. Besides reducing a dataset size, FSA also generally allow to improve the performance of classification and regression models by selecting the most relevant features and reducing the noise in a dataset [8].


Dealing With Imbalanced Datasets

@machinelearnbot

Summary: Dealing with imbalanced datasets is an everyday problem. SMOTE, Synthetic Minority Oversampling TEchnique and its variants are techniques for solving this problem through oversampling that have recently become a very popular way to improve model performance. There are some problems that never go away. Imbalanced datasets is one in which the majority case greatly outweighs the minority case. Years ago we dealt with this by naïve oversampling or, if we had enough data, even under sampling to get the dataset more in balance.


Blockchains for Artificial Intelligence – The BigchainDB Blog

#artificialintelligence

And, it was first published on Dataconomy on Dec 21, 2016; I'm reposting here for ease of access. In May 2017 I gave an updated talk; here's the slides & video.] In recent years, AI (artificial intelligence) researchers have finally cracked problems that they've worked on for decades, from Go to human-level speech recognition. A key piece was the ability to gather and learn on mountains of data, which pulled error rates past the success line. In short, big data has transformed AI, to an almost unreasonable level. Blockchain technology could transform AI too, in its own particular ways. Some applications of blockchains to AI are mundane, like audit trails on AI models. Some appear almost unreasonable, like AI that can own itself -- AI DAOs. All of them are opportunities. This article will explore these applications. Before we discuss applications, let's first review what's different about blockchains compared to traditional big-data distributed databases like MongoDB.