Overview
Theoretical Impediments to Machine Learning With Seven Sparks from the Causal Revolution
Current machine learning systems operate, almost exclusively, in a statistical, or model-free mode, which entails severe theoretical limits on their power and performance. Such systems cannot reason about interventions and retrospection and, therefore, cannot serve as the basis for strong AI. To achieve human level intelligence, learning machines need the guidance of a model of reality, similar to the ones used in causal inference tasks. To demonstrate the essential role of such models, I will present a summary of seven tasks which are beyond reach of current machine learning systems and which have been accomplished using the tools of causal modeling.
Distributed Constraint Optimization Problems and Applications: A Survey
Fioretto, Ferdinando, Pontelli, Enrico, Yeoh, William
The field of Multi-Agent System (MAS) is an active area of research within Artificial Intelligence, with an increasingly important impact in industrial and other real-world applications. Within a MAS, autonomous agents interact to pursue personal interests and/or to achieve common objectives. Distributed Constraint Optimization Problems (DCOPs) have emerged as one of the prominent agent architectures to govern the agents' autonomous behavior, where both algorithms and communication models are driven by the structure of the specific problem. During the last decade, several extensions to the DCOP model have enabled them to support MAS in complex, real-time, and uncertain environments. This survey aims at providing an overview of the DCOP model, giving a classification of its multiple extensions and addressing both resolution methods and applications that find a natural mapping within each class of DCOPs. The proposed classification suggests several future perspectives for DCOP extensions, and identifies challenges in the design of efficient resolution algorithms, possibly through the adaptation of strategies from different areas.
Real World Machine Learning Challenges - CMIH - The Centre for Mathematical Imaging in Healthcare
Machine learning solutions, in particular those based on deep learning methods, form an underpinning of the current revolution in "artificial intelligence" that has dominated popular press headlines and is having a significant influence on the wider tech agenda. In this talk I will give an overview of where we are now with machine learning solutions, and what challenges we face both in the near and far future. These include practical application of existing algorithms in the face of the need to explain decision making, mechanisms for improving the quality and availability of data, dealing with large unstructured datasets.
Netvue Releases World's First Artificial Intelligence Doorbell
It offers a leading-edge approach to respond to visitors and surroundings of the house. Belle will be available on Kickstarter on Jan. 16, 2018, starting at $129, as well as on exhibit at #42925, Sands Hall at CES 2018. Powered by an HD live camera and two-way audio, Belle functions as a 24/7 on-shift liaison between the house owners and their visitors. Through Netvue's mobile app, the real-time view in front of the home entrance is available with a network connection and communicates with guests on a constant basis even when the house owner is away from home. With the infrared night vision, video monitoring in the dark is no longer a blurry sight.
AI for the Masses: How CIOs Can Prepare for Machine Learning's Wide-Ranging Business Impact
Whether we're ready for it or not, artificial intelligence (AI) is infiltrating the enterprise. From natural language processing systems that manage customer service inquiries to automated manufacturing plants staffed by robots, AI technologies, driven by machine learning, are having an impact. And with the accelerated rate of innovation--brought on by exponential increases in computer processing power and the sheer volume of data creation--AI clearly has the potential to transform just about every industry, from aerospace to retail. In a Harvard Business Review article, MIT researchers Erik Brynjolfsson and Andrew McAfee labeled AI "the most important general-purpose technology of our era." The effects of AI will be magnified in the coming decade, they say, as organizations "transform their core processes and business models to take advantage of machine learning."
Character-level Recurrent Neural Networks in Practice: Comparing Training and Sampling Schemes
De Boom, Cedric, Demeester, Thomas, Dhoedt, Bart
Recurrent neural networks are nowadays successfully used in an abundance of applications, going from text, speech and image processing to recommender systems. Backpropagation through time is the algorithm that is commonly used to train these networks on specific tasks. Many deep learning frameworks have their own implementation of training and sampling procedures for recurrent neural networks, while there are in fact multiple other possibilities to choose from and other parameters to tune. In existing literature this is very often overlooked or ignored. In this paper we therefore give an overview of possible training and sampling schemes for character-level recurrent neural networks to solve the task of predicting the next token in a given sequence. We test these different schemes on a variety of datasets, neural network architectures and parameter settings, and formulate a number of take-home recommendations. The choice of training and sampling scheme turns out to be subject to a number of trade-offs, such as training stability, sampling time, model performance and implementation effort, but is largely independent of the data. Perhaps the most surprising result is that transferring hidden states for correctly initializing the model on subsequences often leads to unstable training behavior depending on the dataset.
Quantum Machine Learning: An Overview
At a recent conference in 2017, Microsoft CEO Satya Nadella used the analogy of a corn maze to explain the difference in approach between a classical computer and a quantum computer. In trying to find a path through the maze, a classical computer would start down a path, hit an obstruction, backtrack; start again, hit another obstruction, backtrack again until it ran out of options. Although an answer can be found, this approach could be a very time-consuming. They take every path in the corn maze simultaneously." Thus, leading to an exponential reduction in the number of steps required to solve a problem.
Applications of Deep Learning and Reinforcement Learning to Biological Data
Mahmud, Mufti, Kaiser, M. Shamim, Hussain, Amir, Vassanelli, Stefano
Rapid advances of hardware-based technologies during the past decades have opened up new possibilities for Life scientists to gather multimodal data in various application domains (e.g., Omics, Bioimaging, Medical Imaging, and [Brain/Body]-Machine Interfaces), thus generating novel opportunities for development of dedicated data intensive machine learning techniques. Overall, recent research in Deep learning (DL), Reinforcement learning (RL), and their combination (Deep RL) promise to revolutionize Artificial Intelligence. The growth in computational power accompanied by faster and increased data storage and declining computing costs have already allowed scientists in various fields to apply these techniques on datasets that were previously intractable for their size and complexity. This review article provides a comprehensive survey on the application of DL, RL, and Deep RL techniques in mining Biological data. In addition, we compare performances of DL techniques when applied to different datasets across various application domains. Finally, we outline open issues in this challenging research area and discuss future development perspectives.
Batched High-dimensional Bayesian Optimization via Structural Kernel Learning
Wang, Zi, Li, Chengtao, Jegelka, Stefanie, Kohli, Pushmeet
Optimization of high-dimensional black-box functions is an extremely challenging problem. While Bayesian optimization has emerged as a popular approach for optimizing black-box functions, its applicability has been limited to low-dimensional problems due to its computational and statistical challenges arising from high-dimensional settings. In this paper, we propose to tackle these challenges by (1) assuming a latent additive structure in the function and inferring it properly for more efficient and effective BO, and (2) performing multiple evaluations in parallel to reduce the number of iterations required by the method. Our novel approach learns the latent structure with Gibbs sampling and constructs batched queries using determinantal point processes. Experimental validations on both synthetic and real-world functions demonstrate that the proposed method outperforms the existing state-of-the-art approaches.
A Predictive Approach Using Deep Feature Learning for Electronic Medical Records: A Comparative Study
Nezhad, Milad Zafar, Zhu, Dongxiao, Sadati, Najibesadat, Yang, Kai
Massive amount of electronic medical records accumulating from patients and populations motivates clinicians and data scientists to collaborate for the advanced analytics to extract knowledge that is essential to address the extensive personalized insights needed for patients, clinicians, providers, scientists, and health policy makers. In this paper, we propose a new predictive approach based on feature representation using deep feature learning and word embedding techniques. Our method uses different deep architectures for feature representation in higher-level abstraction to obtain effective and more robust features from EMRs, and then build prediction models on the top of them. Our approach is particularly useful when the unlabeled data is abundant whereas labeled one is scarce. We investigate the performance of representation learning through a supervised approach. First, we apply our method on a small dataset related to a specific precision medicine problem, which focuses on prediction of left ventricular mass indexed to body surface area (LVMI) as an indicator of heart damage risk in a vulnerable demographic subgroup (African-Americans). Then we use two large datasets from eICU collaborative research database to predict the length of stay in Cardiac-ICU and Neuro-ICU based on high dimensional features. Finally we provide a comparative study and show that our predictive approach leads to better results in comparison with others.