Goto

Collaborating Authors

 Government


Client Profiling for an Anti-Money Laundering System

arXiv.org Artificial Intelligence

Acts of prevention and fight against money laundering (ML) crimes are prioritized by almost every government in the world, at the same level of the most relevant global issues. Money laundering is a crime that typically consists in making a certain illegal financial gain into a legal gain. According to the United Nations Office on Drugs and Crimes (UNODC) the annual global estimate of laundered money is about 2% - 5% of the Gross World Product, or US$800 billion - US$2 trillion [1]. As if the financial volume were not enough, another reason for governments to focus on this crime is for the fact that it is clearly connected to other types of crimes such as illegal drug trade, fraud, corruption, kidnapping, terrorism, arms smuggling, among others. Most countries' financial authorities, usually Central Banks, are responsible for controlling and defining antimoney laundering (AML) regulations, demanding from financial institutions the implementation of procedures that apply the defined norms.


Consistent Biclustering

arXiv.org Machine Learning

Biclustering, the process of simultaneously clustering the rows and columns of a data matrix, is a popular and effective tool for finding structure in a high-dimensional dataset. Many biclustering procedures appear to work well in practice, but most do not have associated consistency guarantees. To address this shortcoming, we propose a new biclustering procedure based on profile likelihood. The procedure applies to a broad range of data modalities, including binary, count, and continuous observations. We prove that the procedure recovers the true row and column classes when the dimensions of the data matrix tend to infinity, even if the functional form of the data distribution is misspecified. The procedure requires computing a combinatorial search, which can be expensive in practice. Rather than performing this search directly, we propose a new heuristic optimization procedure based on the Kernighan-Lin heuristic, which has nice computational properties and performs well in simulations. We demonstrate our procedure with applications to congressional voting records, and microarray analysis.


Probabilistic Programming with Gaussian Process Memoization

arXiv.org Machine Learning

Gaussian Processes (GPs) are widely used tools in statistics, machine learning, robotics, computer vision, and scientific computation. However, despite their popularity, they can be difficult to apply; all but the simplest classification or regression applications require specification and inference over complex covariance functions that do not admit simple analytical posteriors. This paper shows how to embed Gaussian processes in any higher-order probabilistic programming language, using an idiom based on memoization, and demonstrates its utility by implementing and extending classic and state-of-the-art GP applications. The interface to Gaussian processes, called gpmem, takes an arbitrary real-valued computational process as input and returns a statistical emulator that automatically improve as the original process is invoked and its input-output behavior is recorded. The flexibility of gpmem is illustrated via three applications: (i) robust GP regression with hierarchical hyper-parameter learning, (ii) discovering symbolic expressions from time-series data by fully Bayesian structure learning over kernels generated by a stochastic grammar, and (iii) a bandit formulation of Bayesian optimization with automatic inference and action selection. All applications share a single 50-line Python library and require fewer than 20 lines of probabilistic code each.


Human-Centered Cognitive Orthoses: Artificial Intelligence for, Rather than Instead of, the People

AI Magazine

This issue of AI Magazine includes six articles on cognitive orthoses, which we broadly conceive as technological approaches that amplify or enhance individual or team cognition across a wide range of goals and activities. The articles are grouped by how they relate to orthoses enhanced socio-technical team intelligence at three different cognitive levels—sensorimotor physical, professional learning, and networked knowledge.


Efficient Learning by Directed Acyclic Graph For Resource Constrained Prediction

Neural Information Processing Systems

We study the problem of reducing test-time acquisition costs in classification systems. Our goal is to learn decision rules that adaptively select sensors for each example as necessary to make a confident prediction. We model our system as a directed acyclic graph (DAG) where internal nodes correspond to sensor subsets and decision functions at each node choose whether to acquire a new sensor or classify using the available measurements. This problem can be naturally posed as an empirical risk minimization over training data. Rather than jointly optimizing such a highly coupled and non-convex problem over all decision nodes, we propose an efficient algorithm motivated by dynamic programming. We learn node policies in the DAG by reducing the global objective to a series of cost sensitive learning problems. Our approach is computationally efficient and has proven guarantees of convergence to the optimal system for a fixed architecture. In addition, we present an extension to map other budgeted learning problems with large number of sensors to our DAG architecture and demonstrate empirical performance exceeding state-of-the-art algorithms for data composed of both few and many sensors.


Backpropagation for Energy-Efficient Neuromorphic Computing

Neural Information Processing Systems

Solving real world problems with embedded neural networks requires both training algorithms that achieve high performance and compatible hardware that runs in real time while remaining energy efficient. For the former, deep learning using backpropagation has recently achieved a string of successes across many domains and datasets. For the latter, neuromorphic chips that run spiking neural networks have recently achieved unprecedented energy efficiency. To bring these two advances together, we must first resolve the incompatibility between backpropagation, which uses continuous-output neurons and synaptic weights, and neuromorphic designs, which employ spiking neurons and discrete synapses. Our approach is to treat spikes and discrete synapses as continuous probabilities, which allows training the network using standard backpropagation. The trained network naturally maps to neuromorphic hardware by sampling the probabilities to create one or more networks, which are merged using ensemble averaging. To demonstrate, we trained a sparsely connected network that runs on the TrueNorth chip using the MNIST dataset. With a high performance network (ensemble of $64$), we achieve $99.42\%$ accuracy at $121 \mu$J per image, and with a high efficiency network (ensemble of $1$) we achieve $92.7\%$ accuracy at $0.408 \mu$J per image.


Research Priorities for Robust and Beneficial Artificial Intelligence

AI Magazine

Success in the quest for artificial intelligence has the potential to bring unprecedented benefits to humanity, and it is therefore worthwhile to investigate how to maximize these benefits while avoiding potential pitfalls. This article gives numerous examples (which should by no means be construed as an exhaustive list) of such worthwhile research aimed at ensuring that AI remains robust and beneficial.


Cognition as a Service: An Industry Perspective

AI Magazine

Recent advances in cognitive computing componentry combined with other factors are leading to commercially viable cognitive systems. From chips to smart phones to public and private clouds, industrial strength “cognition as a service” is beginning to appear at all scales in business and society. Furthermore, in the age of zettabytes on the way to yottabytes, the designers, engineers, and managers of future smart systems will depend on cognition as a service. Cognition as a service can help unlock the mysteries of big data and ultimately boost the creativity and productivity of professionals and their teams, the productive output of industries and organizations, as well as the GDP (gross domestic product) of regions and nations. In this and the next decade, cognition as a service will allow us to re-image work practices, augmenting and scaling expertise to transform professions, industries, and regions.


Extending the Diagnostic Capabilities of Artificial Intelligence-Based Instructional Systems

AI Magazine

Active problem solving has been shown to be one of the most effective ways to acquire complex skills. Whether one is learning a programming language by implementing a computer program, or learning calculus by solving problems, context sensitive feedback and guidance are crucial to keeping problem solving efforts fruitful and efficient. This article reviews AI-based algorithms that can diagnose student difficulties during active problem solving and serve as the basis for providing context-sensitive and individualized guidance. The article also describes the crucial role sensor based estimates of cognitive resources such as working memory capacity and attention can play in enhancing the diagnostic capabilities of intelligent instructional systems.


Human-Centered Design of Wearable Neuroprostheses and Exoskeletons

AI Magazine

Human-centered design of wearable robots involves the development of innovative science and technologies that minimize the mismatch between humans’ and machines’ capabilities, leading to their intuitive integration and confluent interaction. Here, we summarize our human-centered approach to the design of closed-loop brain-machine interfaces (BMI) to powered prostheses and exoskeletons that allow people to act beyond their impaired or diminished physical or sensory-motor capabilities. The goal is to develop multifunctional human-machine interfaces with integrated diagnostic, assistive and therapeutic functions. Moreover, these complex human-machine systems should be effective, reliable, safe and engaging and support the patient in performing intended actions with minimal effort and errors with adequate interaction time. To illustrate our approach, we review an example of a user-in-the-loop, patient-centered, non-invasive BMI system to a powered exoskeleton for persons with paraplegia. We conclude with a summary of challenges to the translation of these complex human-machine systems to the end-user.