AITopics | Overview

Collaborating Authors

Overview

Policy Distillation and Value Matching in Multiagent Reinforcement Learning

Wadhwania, Samir, Kim, Dong-Ki, Omidshafiei, Shayegan, How, Jonathan P.

arXiv.org Artificial IntelligenceMar-15-2019

Multiagent reinforcement learning algorithms (MARL) have been demonstrated on complex tasks that require the coordination of a team of multiple agents to complete. Existing works have focused on sharing information between agents via centralized critics to stabilize learning or through communication to increase performance, but do not generally look at how information can be shared between agents to address the curse of dimensionality in MARL. We posit that a multiagent problem can be decomposed into a multi-task problem where each agent explores a subset of the state space instead of exploring the entire state space. This paper introduces a multiagent actor-critic algorithm and method for combining knowledge from homogeneous agents through distillation and value-matching that outperforms policy distillation alone and allows further learning in both discrete and continuous action spaces.

agent, artificial intelligence, machine learning, (17 more...)

arXiv.org Artificial Intelligence

1903.06592

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > New York > New York County > New York City (0.04)

Genre:

Research Report (0.64)
Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Review of Reinforcement Learning for Autonomous Building Energy Management

Mason, Karl, Grijalva, Santiago

arXiv.org Machine LearningMar-15-2019

The area of building energy management has received a significant amount of interest in recent years. This area is concerned with combining advancements in sensor technologies, communications and advanced control algorithms to optimize energy utilization. Reinforcement learning is one of the most prominent machine learning algorithms used for control problems and has had many successful applications in the area of building energy management. This research gives a comprehensive review of the literature relating to the application of reinforcement learning to developing autonomous building energy management systems. The main direction for future research and challenges in reinforcement learning are also outlined.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

arXiv.org Machine Learning

1903.05196

Country:

North America > United States (0.68)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.28)

Genre:

Overview (1.00)
Research Report > Experimental Study (0.34)

Industry:

Transportation > Ground > Road (1.00)
Energy > Power Industry (1.00)
Energy > Renewable > Solar (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)

Add feedback

7 Enabling Capabilities To Improve Poor Results From Massive AI

#artificialintelligenceMar-13-2019, 17:50:38 GMT

A survey of over 1,200 executives has just revealed that despite massive and increasing investments in digital transformation and technologies such as artificial intelligence and big data, companies are struggling to turn those investments into real business results. A survey unveiled today by Deloitte has found that the number of companies investing heavily in digital transformation has almost doubled in the past year. The accounting and services giant questioned 1,200 executives at organizations of at least 500 people with above $250 million in revenue, finding that 19% planned to invest $20 million or more during 2019. When asked the same question at the start of 2018, 10% gave that answer. Despite Massive Investments In AI And Digital Transformation, Survey Finds Poor Results And 7 Enabling Capabilities The term "digital transformation" has come to mean steps that move an organization towards adopting data-driven business models, typically involving artificial intelligence (AI), big data and predictive analytics technology.

artificial intelligence, big data, data mining, (14 more...)

#artificialintelligence

Genre: Overview (0.36)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.57)

Add feedback

What is Automation Anywhere tool?

#artificialintelligenceMar-13-2019, 03:32:06 GMT

Robotic Process Automation is a revolutionary technology that streamlines and automates daily repetitive tasks, thus, minimizing errors to almost zero and increasing productivity to a new level. Automation Anywhere is a developer of robotic process automation (RPA) software. It is one of the game-changing technologies that changes the way an enterprise operates. Automation Anywhere tool combines robotic process automation solutions with intellectual elements like natural language understanding and reading unstructured data. Automation Anywhere allows organizations to automate everyday processes which are performed by the staff.

artificial intelligence, bot, control room, (3 more...)

#artificialintelligence

Genre: Overview > Innovation (0.60)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

AutoML @ NeurIPS 2018 challenge: Design and Results

Escalante, Hugo Jair, Tu, Wei-Wei, Guyon, Isabelle, Silver, Daniel L., Viegas, Evelyne, Chen, Yuqiang, Dai, Wenyuan, Yang, Qiang

arXiv.org Machine LearningMar-13-2019

We organized a competition on Autonomous Lifelong Machine Learning with Drift that was part of the competition program of NeurIPS 2018. This data driven competition asked participants to develop computer programs capable of solving supervised learning problems where the i.i.d. assumption did not hold. Large data sets were arranged in a lifelong learning and evaluation scenario and CodaLab was used as the challenge platform. The challenge attracted more than 300 participants in its two month duration. This chapter describes the design of the challenge and summarizes its main results.

artificial intelligence, machine learning, participant, (16 more...)

arXiv.org Machine Learning

1903.05263

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > China > Beijing > Beijing (0.04)
Asia > China > Hong Kong (0.04)
(13 more...)

Genre:

Overview (0.46)
Research Report (0.40)
Questionnaire & Opinion Survey (0.34)

Industry: Education > Educational Setting > Continuing Education (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Machine Learning in IoT Security: Current Solutions and Future Challenges

Hussain, Fatima, Hussain, Rasheed, Hassan, Syed Ali, Hossain, Ekram

arXiv.org Machine LearningMar-13-2019

The future Internet of Things (IoT) will have a deep economical, commercial and social impact on our lives. The participating nodes in IoT networks are usually resource-constrained, which makes them luring targets for cyber attacks. In this regard, extensive efforts have been made to address the security and privacy issues in IoT networks primarily through traditional cryptographic approaches. However, the unique characteristics of IoT nodes render the existing solutions insufficient to encompass the entire security spectrum of the IoT networks. This is, at least in part, because of the resource constraints, heterogeneity, massive real-time data generated by the IoT devices, and the extensively dynamic behavior of the networks. Therefore, Machine Learning (ML) and Deep Learning (DL) techniques, which are able to provide embedded intelligence in the IoT devices and networks, are leveraged to cope with different security problems. In this paper, we systematically review the security requirements, attack vectors, and the current security solutions for the IoT networks. We then shed light on the gaps in these security solutions that call for ML and DL approaches. We also discuss in detail the existing ML and DL solutions for addressing different security problems in IoT networks. At last, based on the detailed investigation of the existing solutions in the literature, we discuss the future research directions for ML- and DL-based IoT security.

data mining, machine learning, reinforcement learning, (20 more...)

arXiv.org Machine Learning

1904.05735

Country: North America > Canada (0.92)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.45)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (0.47)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Internet of Things (1.00)
Information Technology > Data Science > Data Mining (1.00)
(5 more...)

Add feedback

Elements of Sequential Monte Carlo

Naesseth, Christian A., Lindsten, Fredrik, Schön, Thomas B.

arXiv.org Machine LearningMar-12-2019

A core problem in statistics and probabilistic machine learning is to compute probability distributions and expectations. This is the fundamental problem of Bayesian statistics and machine learning, which frames all inference as expectations with respect to the posterior distribution. The key challenge is to approximate these intractable expectations. In this tutorial, we review sequential Monte Carlo (SMC), a random-sampling-based class of methods for approximate inference. First, we explain the basics of SMC, discuss practical issues, and review theoretical results. We then examine two of the main user design choices: the proposal distributions and the so called intermediate target distributions. We review recent results on how variational inference and amortization can be used to learn efficient proposals and target distributions. Next, we discuss the SMC estimate of the normalizing constant, how this can be used for pseudo-marginal inference and inference evaluation. Throughout the tutorial we illustrate the use of SMC on various models commonly used in machine learning, such as stochastic recurrent neural networks, probabilistic graphical models, and probabilistic programs.

approximation, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

1903.04797

Country:

Europe > Sweden > Östergötland County > Linköping (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > New York (0.04)
(6 more...)

Genre:

Research Report (0.81)
Overview (0.67)
Instructional Material > Course Syllabus & Notes (0.45)

Add feedback

Business leaders love artificial intelligence - but only in theory

#artificialintelligenceMar-11-2019, 12:53:54 GMT

Microsoft has unveiled the results of a survey of business leaders on the topic of artificial intelligence. The findings are surprising: German and Russian entrepreneurs and executives appear to come out ahead of those from the US and other advanced European economies when it comes to adopting the technology. Mostly, however, this and several other studies confirm a frustrating problem: The AI hype is making it impossible to figure out how much businesses really need it and are using it. The 800 respondents in the study came from seven countries – the US, Germany, France, the UK, Italy, the Netherlands and Switzerland. It's not a globe-spanning dataset and it doesn't include the potential AI leader, China, or one of the leaders in AI research, Canada.

artificial intelligence, business leader love artificial intelligence, implementation, (2 more...)

#artificialintelligence

Country:

Europe (1.00)
North America > Canada > Ontario > Toronto (0.16)

Genre: Overview (0.36)

Industry: Banking & Finance (0.31)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Deep learning for molecular generation and optimization - a review of the state of the art

Elton, Daniel C., Boukouvalas, Zois, Fuge, Mark D., Chung, Peter W.

arXiv.org Machine LearningMar-11-2019

In the space of only a few years, deep generative modeling has revolutionized how we think of artificial creativity, yielding autonomous systems which produce original images, music, and text. Inspired by these successes, researchers are now applying deep generative modeling techniques to the generation and optimization of molecules - in our review we found 45 papers on the subject published in the past two years. These works point to a future where such systems will be used to generate lead molecules, greatly reducing resources spent downstream synthesizing and characterizing bad leads in the lab. In this review we survey the increasingly complex landscape of models and representation schemes that have been proposed. The four classes of techniques we describe are recursive neural networks, autoencoders, generative adversarial networks, and reinforcement learning. After first discussing some of the mathematical fundamentals of each technique, we draw high level connections and comparisons with other techniques and expose the pros and cons of each. Several important high level themes emerge as a result of this work, including the shift away from the SMILES string representation of molecules towards more sophisticated representations such as graph grammars and 3D representations, the importance of reward function design, the need for better standards for benchmarking and testing, and the benefits of adversarial training and reinforcement learning over maximum likelihood based training.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Machine Learning

1903.04388

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Maryland > Prince George's County > College Park (0.04)
Europe > France (0.04)
(5 more...)

Genre:

Research Report (1.00)
Overview (0.87)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Energy > Renewable (0.67)
Materials (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

Add feedback

A Survey on Transfer Learning for Multiagent Reinforcement Learning Systems

Silva, Felipe Leno Da, Costa, Anna Helena Reali

Journal of Artificial Intelligence ResearchMar-11-2019

Multiagent Reinforcement Learning (RL) solves complex tasks that require coordination with other agents through autonomous exploration of the environment. However, learning a complex task from scratch is impractical due to the huge sample complexity of RL algorithms. For this reason, reusing knowledge that can come from previous experience or other agents is indispensable to scale up multiagent RL algorithms. This survey provides a unifying view of the literature on knowledge reuse in multiagent RL. We define a taxonomy of solutions for the general knowledge reuse problem, providing a comprehensive discussion of recent progress on knowledge reuse in Multiagent Systems (MAS) and of techniques for knowledge reuse across agents (that may be actuating in a shared environment or not). We aim at encouraging the community to work towards reusing all the knowledge sources available in a MAS. For that, we provide an in-depth discussion of current lines of research and open questions.

agent, knowledge, learning, (13 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.11396

AI Access Foundation

11396

Journal of Artificial Intelligence Research

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Texas > Travis County > Austin (0.14)
South America > Brazil > São Paulo (0.04)
(3 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Leisure & Entertainment > Games (1.00)
Education (1.00)
Leisure & Entertainment > Sports > Soccer (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)

Add feedback