Goto

Collaborating Authors

 Overview


Policy Distillation and Value Matching in Multiagent Reinforcement Learning

arXiv.org Artificial Intelligence

Multiagent reinforcement learning algorithms (MARL) have been demonstrated on complex tasks that require the coordination of a team of multiple agents to complete. Existing works have focused on sharing information between agents via centralized critics to stabilize learning or through communication to increase performance, but do not generally look at how information can be shared between agents to address the curse of dimensionality in MARL. We posit that a multiagent problem can be decomposed into a multi-task problem where each agent explores a subset of the state space instead of exploring the entire state space. This paper introduces a multiagent actor-critic algorithm and method for combining knowledge from homogeneous agents through distillation and value-matching that outperforms policy distillation alone and allows further learning in both discrete and continuous action spaces.


A Review of Reinforcement Learning for Autonomous Building Energy Management

arXiv.org Machine Learning

The area of building energy management has received a significant amount of interest in recent years. This area is concerned with combining advancements in sensor technologies, communications and advanced control algorithms to optimize energy utilization. Reinforcement learning is one of the most prominent machine learning algorithms used for control problems and has had many successful applications in the area of building energy management. This research gives a comprehensive review of the literature relating to the application of reinforcement learning to developing autonomous building energy management systems. The main direction for future research and challenges in reinforcement learning are also outlined.


7 Enabling Capabilities To Improve Poor Results From Massive AI

#artificialintelligence

A survey of over 1,200 executives has just revealed that despite massive and increasing investments in digital transformation and technologies such as artificial intelligence and big data, companies are struggling to turn those investments into real business results. A survey unveiled today by Deloitte has found that the number of companies investing heavily in digital transformation has almost doubled in the past year. The accounting and services giant questioned 1,200 executives at organizations of at least 500 people with above $250 million in revenue, finding that 19% planned to invest $20 million or more during 2019. When asked the same question at the start of 2018, 10% gave that answer. Despite Massive Investments In AI And Digital Transformation, Survey Finds Poor Results And 7 Enabling Capabilities The term "digital transformation" has come to mean steps that move an organization towards adopting data-driven business models, typically involving artificial intelligence (AI), big data and predictive analytics technology.


What is Automation Anywhere tool?

#artificialintelligence

Robotic Process Automation is a revolutionary technology that streamlines and automates daily repetitive tasks, thus, minimizing errors to almost zero and increasing productivity to a new level. Automation Anywhere is a developer of robotic process automation (RPA) software. It is one of the game-changing technologies that changes the way an enterprise operates. Automation Anywhere tool combines robotic process automation solutions with intellectual elements like natural language understanding and reading unstructured data. Automation Anywhere allows organizations to automate everyday processes which are performed by the staff.


AutoML @ NeurIPS 2018 challenge: Design and Results

arXiv.org Machine Learning

We organized a competition on Autonomous Lifelong Machine Learning with Drift that was part of the competition program of NeurIPS 2018. This data driven competition asked participants to develop computer programs capable of solving supervised learning problems where the i.i.d. assumption did not hold. Large data sets were arranged in a lifelong learning and evaluation scenario and CodaLab was used as the challenge platform. The challenge attracted more than 300 participants in its two month duration. This chapter describes the design of the challenge and summarizes its main results.


Machine Learning in IoT Security: Current Solutions and Future Challenges

arXiv.org Machine Learning

The future Internet of Things (IoT) will have a deep economical, commercial and social impact on our lives. The participating nodes in IoT networks are usually resource-constrained, which makes them luring targets for cyber attacks. In this regard, extensive efforts have been made to address the security and privacy issues in IoT networks primarily through traditional cryptographic approaches. However, the unique characteristics of IoT nodes render the existing solutions insufficient to encompass the entire security spectrum of the IoT networks. This is, at least in part, because of the resource constraints, heterogeneity, massive real-time data generated by the IoT devices, and the extensively dynamic behavior of the networks. Therefore, Machine Learning (ML) and Deep Learning (DL) techniques, which are able to provide embedded intelligence in the IoT devices and networks, are leveraged to cope with different security problems. In this paper, we systematically review the security requirements, attack vectors, and the current security solutions for the IoT networks. We then shed light on the gaps in these security solutions that call for ML and DL approaches. We also discuss in detail the existing ML and DL solutions for addressing different security problems in IoT networks. At last, based on the detailed investigation of the existing solutions in the literature, we discuss the future research directions for ML- and DL-based IoT security.


Elements of Sequential Monte Carlo

arXiv.org Machine Learning

A core problem in statistics and probabilistic machine learning is to compute probability distributions and expectations. This is the fundamental problem of Bayesian statistics and machine learning, which frames all inference as expectations with respect to the posterior distribution. The key challenge is to approximate these intractable expectations. In this tutorial, we review sequential Monte Carlo (SMC), a random-sampling-based class of methods for approximate inference. First, we explain the basics of SMC, discuss practical issues, and review theoretical results. We then examine two of the main user design choices: the proposal distributions and the so called intermediate target distributions. We review recent results on how variational inference and amortization can be used to learn efficient proposals and target distributions. Next, we discuss the SMC estimate of the normalizing constant, how this can be used for pseudo-marginal inference and inference evaluation. Throughout the tutorial we illustrate the use of SMC on various models commonly used in machine learning, such as stochastic recurrent neural networks, probabilistic graphical models, and probabilistic programs.


Business leaders love artificial intelligence - but only in theory

#artificialintelligence

Microsoft has unveiled the results of a survey of business leaders on the topic of artificial intelligence. The findings are surprising: German and Russian entrepreneurs and executives appear to come out ahead of those from the US and other advanced European economies when it comes to adopting the technology. Mostly, however, this and several other studies confirm a frustrating problem: The AI hype is making it impossible to figure out how much businesses really need it and are using it. The 800 respondents in the study came from seven countries – the US, Germany, France, the UK, Italy, the Netherlands and Switzerland. It's not a globe-spanning dataset and it doesn't include the potential AI leader, China, or one of the leaders in AI research, Canada.


Deep learning for molecular generation and optimization - a review of the state of the art

arXiv.org Machine Learning

In the space of only a few years, deep generative modeling has revolutionized how we think of artificial creativity, yielding autonomous systems which produce original images, music, and text. Inspired by these successes, researchers are now applying deep generative modeling techniques to the generation and optimization of molecules - in our review we found 45 papers on the subject published in the past two years. These works point to a future where such systems will be used to generate lead molecules, greatly reducing resources spent downstream synthesizing and characterizing bad leads in the lab. In this review we survey the increasingly complex landscape of models and representation schemes that have been proposed. The four classes of techniques we describe are recursive neural networks, autoencoders, generative adversarial networks, and reinforcement learning. After first discussing some of the mathematical fundamentals of each technique, we draw high level connections and comparisons with other techniques and expose the pros and cons of each. Several important high level themes emerge as a result of this work, including the shift away from the SMILES string representation of molecules towards more sophisticated representations such as graph grammars and 3D representations, the importance of reward function design, the need for better standards for benchmarking and testing, and the benefits of adversarial training and reinforcement learning over maximum likelihood based training.


A Survey on Transfer Learning for Multiagent Reinforcement Learning Systems

Journal of Artificial Intelligence Research

Multiagent Reinforcement Learning (RL) solves complex tasks that require coordination with other agents through autonomous exploration of the environment. However, learning a complex task from scratch is impractical due to the huge sample complexity of RL algorithms. For this reason, reusing knowledge that can come from previous experience or other agents is indispensable to scale up multiagent RL algorithms. This survey provides a unifying view of the literature on knowledge reuse in multiagent RL. We define a taxonomy of solutions for the general knowledge reuse problem, providing a comprehensive discussion of recent progress on knowledge reuse in Multiagent Systems (MAS) and of techniques for knowledge reuse across agents (that may be actuating in a shared environment or not). We aim at encouraging the community to work towards reusing all the knowledge sources available in a MAS. For that, we provide an in-depth discussion of current lines of research and open questions.