AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

AI Flight with Unity ML-Agents

#artificialintelligenceAug-26-2019, 11:55:28 GMT

A new project with planes taught to fly with reinforcement learning via Unity ML-Agents. This is still a work in progress, but the AI behavior is working amazingly.

artificial intelligence, machine learning, reinforcement learning, (4 more...)

#artificialintelligence

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.35)

Add feedback

Intercon World Keynote Dr. Ganapathi Pulipaka Receives a Top 50 Technology Leader Award for His Contributions to AI, Machine Learning, Mathematics, and Data Science

#artificialintelligenceAug-26-2019, 11:53:11 GMT

At the Intercon conference, Dr. GP gave a motivational keynote speech on Deep Reinforcement Learning and the landscape of machine learning and artificial intelligence that inspired the audience. He noted that the MIT Technology Review has downloaded 16,625 research papers from arxiv that are publicly available under the computer science and artificial intelligence section through November 2018. Through natural language processing techniques on the abstracts, the words "constraint," "theory," "rule," "logic," "program," "learning," "network," "data," "task," and "performance" have been evaluated to find the reinforcement learning boom in recent times. Dr. GP said trends have shown the rise of traditional neural networks in the 1950s and 1960s, symbolic approaches in the 1970s, knowledge-based and rule-based systems in 1980s, support vector machines in 1990s, and the reign of neural networks in the 2010s with the advent of heavy implementation of deep neural networks. Deep Traffic is a reinforcement learning simulation based on the 24,000 entries received on MIT's Deep Traffic competition on self-driving cars that drive on a multi-lane freeway with a model-free off-policy reinforcement learning process that inspires a number of data scientists and machine learning enthusiasts to evaluate the Deep-Q-Learning reinforcement learning network variants and hyperparameter configurations with episodic iterations training of 96.6 years of RL simulations, 572.2 million crowdsourced and optimized DQN hyperparameters to train the agents successfully.

ganapathi pulipaka, machine learning, reinforcement learning, (11 more...)

#artificialintelligence

Country: North America > United States > California > San Francisco County > San Francisco (0.16)

Genre: Press Release (0.30)

Industry:

Information Technology (1.00)
Media > News (0.50)
Education > Curriculum > Subject-Specific Education (0.41)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.55)

Add feedback

Urban flows prediction from spatial-temporal data using machine learning: A survey

Xie, Peng, Li, Tianrui, Liu, Jia, Du, Shengdong, Yang, Xin, Zhang, Junbo

arXiv.org Machine LearningAug-26-2019

Urban spatial-temporal flows prediction is of great importance to traffic management, land use, public safety, etc. Urban flows are affected by several complex and dynamic factors, such as patterns of human activities, weather, events and holidays. Datasets evaluated the flows come from various sources in different domains, e.g. mobile phone data, taxi trajectories data, metro/bus swiping data, bike-sharing data and so on. To summarize these methodologies of urban flows prediction, in this paper, we first introduce four main factors affecting urban flows. Second, in order to further analysis urban flows, a preparation process of multi-sources spatial-temporal data related with urban flows is partitioned into three groups. Third, we choose the spatial-temporal dynamic data as a case study for the urban flows prediction task. Fourth, we analyze and compare some well-known and state-of-the-art flows prediction methods in detail, classifying them into five categories: statistics-based, traditional machine learning-based, deep learning-based, reinforcement learning-based and transfer learning-based methods. Finally, we give open challenges of urban flows prediction and an outlook in the future of this field. This paper will facilitate researchers find suitable methods and open datasets for addressing urban spatial-temporal flows forecast problems.

machine learning, prediction, reinforcement learning, (18 more...)

arXiv.org Machine Learning

1908.10218

Country: North America > United States (1.00)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Transportation > Ground > Road (1.00)
Transportation > Infrastructure & Services (0.95)
Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

AI Development and Trends in E-Commerce

#artificialintelligenceAug-25-2019, 15:16:29 GMT

The traditional retail industry is undergoing a significant reinvention and upgrade as more and more brick and mortar stores boost business by adopting e-commerce platforms powered by cutting-edge tech. The recent rapid development and deployment of AI technologies such as machine learning, computer vision and reinforcement learning have enabled new e-commerce products and solutions for various scenarios and strengthened the retail value chain. Alibaba's Taobao and Tmall, Amazon, JD.com); or on a brand's own official web stores (e.g. Thanks to recent advancements in AI and digital technologies, operating costs for e-commerce have been reduced, enabling more retailers to realize e-commerce transformations. The 2018 global retail e-commerce market amounted to US$2.8 trillion and is expected to grow 75 percent to US$4.9 trillion by 2021.

machine learning, natural language, reinforcement learning, (15 more...)

#artificialintelligence

Industry:

Retail (1.00)
Information Technology > Services > e-Commerce Services (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.99)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.31)

Add feedback

Tutorial and Survey on Probabilistic Graphical Model and Variational Inference in Deep Reinforcement Learning

Sun, Xudong, Bischl, Bernd

arXiv.org Artificial IntelligenceAug-25-2019

Probabilistic Graphical Modeling and Variational Inference play an important role in recent advances in Deep Reinforcement Learning. Aiming at a self-consistent tutorial survey, this article illustrates basic concepts of reinforcement learning with Probabilistic Graphical Models, as well as derivation of some basic formula as a recap. Reviews and comparisons on recent advances in deep reinforcement learning with different research directions are made from various aspects. We offer Probabilistic Graphical Models, detailed explanation and derivation to several use cases of Variational Inference, which serve as a complementary material on top of the original contributions.

artificial intelligence, machine learning, reinforcement learning, (10 more...)

arXiv.org Artificial Intelligence

1908.09381

Country: Europe > Germany (0.14)

Genre:

Research Report (0.50)
Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Curiosity

#artificialintelligenceAug-23-2019, 08:18:30 GMT

DeepCubeA, a deep reinforcement learning algorithm, can find the solution in a fraction of a second, without any specific domain knowledge or in-game coaching from humans.

curiosity, machine learning, reinforcement learning, (1 more...)

#artificialintelligence

Industry:

Leisure & Entertainment > Games > Rubik's Cube (0.40)
Information Technology > Services (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Curiosity

#artificialintelligenceAug-23-2019, 08:18:30 GMT

DeepCubeA, a deep reinforcement learning algorithm, can find the solution in a fraction of a second, without any specific domain knowledge or in-game coaching from humans.

curiosity, machine learning, reinforcement learning, (1 more...)

#artificialintelligence

Industry:

Leisure & Entertainment > Games > Rubik's Cube (0.40)
Information Technology > Services (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Reinforcement Learning in Healthcare: A Survey

Yu, Chao, Liu, Jiming, Nemati, Shamim

arXiv.org Artificial IntelligenceAug-22-2019

As a subfield of machine learning, \emph{reinforcement learning} (RL) aims at empowering one's capabilities in behavioural decision making by using interaction experience with the world and an evaluative feedback. Unlike traditional supervised learning methods that usually rely on one-shot, exhaustive and supervised reward signals, RL tackles with sequential decision making problems with sampled, evaluative and delayed feedback simultaneously. Such distinctive features make RL technique a suitable candidate for developing powerful solutions in a variety of healthcare domains, where diagnosing decisions or treatment regimes are usually characterized by a prolonged and sequential procedure. This survey will discuss the broad applications of RL techniques in healthcare domains, in order to provide the research community with systematic understanding of theoretical foundations, enabling methods and techniques, existing challenges, and new insights of this emerging paradigm. By first briefly examining theoretical foundations and key techniques in RL research from efficient and representational directions, we then provide an overview of RL applications in a variety of healthcare domains, ranging from dynamic treatment regimes in chronic diseases and critical care, automated medical diagnosis from both unstructured and structured clinical data, as well as many other control or scheduling domains that have infiltrated many aspects of a healthcare system. Finally, we summarize the challenges and open issues in current research, and point out some potential solutions and directions for future research.

nephrology, upstream oil & gas, vascular disease, (31 more...)

arXiv.org Artificial Intelligence

1908.08796

Country:

North America > United States (0.92)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.27)
North America > Canada (0.14)
(2 more...)

Genre:

Research Report > Strength High (1.00)
Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
(2 more...)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
(15 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
(2 more...)

Add feedback

Practical Risk Measures in Reinforcement Learning

Di Castro, Dotan, Oren, Joel, Mannor, Shie

arXiv.org Machine LearningAug-22-2019

Practical application of Reinforcement Learning (RL) often involves risk considerations. We study a generalized approximation scheme for risk measures, based on Monte-Carlo simulations, where the risk measures need not necessarily be \emph{coherent}. We demonstrate that, even in simple problems, measures such as the variance of the reward-to-go do not capture the risk in a satisfactory manner. In addition, we show how a risk measure can be derived from model's realizations. We propose a neural architecture for estimating the risk and suggest the risk critic architecture that can be use to optimize a policy under general risk measures. We conclude our work with experiments that demonstrate the efficacy of our approach.

architecture, risk function, risk measure, (14 more...)

arXiv.org Machine Learning

1908.08379

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
North America > United States > California > San Mateo County > Menlo Park (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)

Add feedback

Opponent Aware Reinforcement Learning

Gallego, Victor, Naveiro, Roi, Insua, David Rios, Oteiza, David Gomez-Ullate

arXiv.org Machine LearningAug-22-2019

In several reinforcement learning (RL) scenarios such as security settings, there may be adversaries trying to interfere with the reward generating process for their own benefit. We introduce Threatened Markov Decision Processes (TMDPs) as a framework to support an agent against potential opponents in a RL context. We also propose a level-k thinking scheme resulting in a novel learning approach to deal with TMDPs. After introducing our framework and deriving theoretical results, relevant empirical evidence is given via extensive experiments, showing the benefits of accounting for adversaries in RL while the agent learns

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Machine Learning

1908.08773

Country: North America > United States (0.28)

Genre:

Research Report (0.64)
Overview (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.48)
(2 more...)

Add feedback