AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

Feedback-Based Tree Search for Reinforcement Learning

Jiang, Daniel R., Ekwedike, Emmanuel, Liu, Han

arXiv.org Artificial IntelligenceMay-15-2018

Inspired by recent successes of Monte-Carlo tree search (MCTS) in a number of artificial intelligence (AI) application domains, we propose a model-based reinforcement learning (RL) technique that iteratively applies MCTS on batches of small, finite-horizon versions of the original infinite-horizon Markov decision process. The terminal condition of the finite-horizon problems, or the leaf-node evaluator of the decision tree generated by MCTS, is specified using a combination of an estimated value function and an estimated policy function. The recommendations generated by the MCTS procedure are then provided as feedback in order to refine, through classification and regression, the leaf-node evaluator for the next iteration. We provide the first sample complexity bounds for a tree search-based RL algorithm. In addition, we show that a deep neural network implementation of the technique can create a competitive AI agent for the popular multi-player online battle arena (MOBA) game King of Glory.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

1805.05935

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Graph Signal Sampling via Reinforcement Learning

Abramenko, Oleksii, Jung, Alexander

arXiv.org Artificial IntelligenceMay-15-2018

Modern information processing systems generate massive datasets which are often strongly heterogeneous, e.g., partially labeled mixtures of different media (audio, video, text). A quite successful approach to such datasets is based on representing the data as networks or graphs. In particular, we represent datasets by graph signals defined over an underlying graph, which reflects similarities between individual data points. The graph signal values encode label information which often conforms to a clustering hypothesis, i.e., the signal values (labels) of close-by nodes (similar data points) are similar. Two core problems considered within graph signal processing (GSP) are (i) how to sample them, i.e., which signal values provide the most information about the entire dataset, and (ii) how to recover the entire graph signal from these few signal values (samples). These problems have been studied in [1]-[6] which discussed convex optimization methods for recovering a graph signal from a small number of signal values observed on the nodes belonging to a given (small) sampling set. Sufficient conditions on the sampling set and clustering structure such that these convex methods are successful have been discussed in [4], [7].

data mining, machine learning, reinforcement learning, (20 more...)

arXiv.org Artificial Intelligence

1805.05827

Genre: Research Report (0.64)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.42)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.34)

Add feedback

Leveraging human knowledge in tabular reinforcement learning: A study of human subjects

Rosenfeld, Ariel, Cohen, Moshe, Taylor, Matthew E., Kraus, Sarit

arXiv.org Artificial IntelligenceMay-15-2018

Reinforcement Learning (RL) can be extremely effective in solving complex, real-world problems. However, injecting human knowledge into an RL agent may require extensive effort and expertise on the human designer's part. To date, human factors are generally not considered in the development and evaluation of possible RL approaches. In this article, we set out to investigate how different methods for injecting human knowledge are applied, in practice, by human designers of varying levels of knowledge and skill. We perform the first empirical evaluation of several methods, including a newly proposed method named SASS which is based on the notion of similarities in the agent's state-action space. Through this human study, consisting of 51 human participants, we shed new light on the human factors that play a key role in RL. We find that the classical reward shaping technique seems to be the most natural method for most designers, both expert and non-expert, to speed up RL. However, we further find that our proposed method SASS can be effectively and efficiently combined with reward shaping, and provides a beneficial alternative to using only a single speedup method with minimal human designer effort overhead.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

1805.05769

Country:

North America > United States (1.00)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.94)
Questionnaire & Opinion Survey (0.93)
Overview (0.92)

Industry:

Leisure & Entertainment > Sports > Soccer (1.00)
Leisure & Entertainment > Games > Computer Games (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Data Science: Supervised Machine Learning in Python

@machinelearnbotMay-14-2018, 21:23:26 GMT

In recent years, we've seen a resurgence in AI, or artificial intelligence, and machine learning. Machine learning has led to some amazing results, like being able to analyze medical images and predict diseases on-par with human experts. Google's AlphaGo program was able to beat a world champion in the strategy game go using deep reinforcement learning. Machine learning is even being used to program self driving cars, which is going to change the automotive industry forever. Imagine a world with drastically reduced car accidents, simply by removing the element of human error.

artificial intelligence, machine learning, reinforcement learning, (6 more...)

@machinelearnbot

Genre: Instructional Material > Course Syllabus & Notes (0.68)

Industry:

Automobiles & Trucks (0.78)
Information Technology (0.72)
Leisure & Entertainment > Games (0.56)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.54)
(3 more...)

Add feedback

Data Science in 90 Seconds: Reinforcement Learning - DATAVERSITY

#artificialintelligenceMay-14-2018, 14:01:49 GMT

Laura was born in a small town in North Carolina. She went on to earn a B.S. in Textile Engineering and a B.A. in Spanish at North Carolina State University. Laura thought this unique combination of majors would be amazing after attending a summer camp in high school where she played with bouncing polymers. While attending North Carolina State University, she earned a scholarship to study a summer term in Peru, where she fell in love with the Spanish language. Upon graduation, she moved to Washington, D.C. where she served in a variety of digital information roles.

artificial intelligence, machine learning, reinforcement learning, (3 more...)

#artificialintelligence

Country:

North America > United States > North Carolina (0.82)
North America > United States > District of Columbia > Washington (0.30)

Genre: Personal (0.65)

Industry: Education (0.65)

Technology:

Information Technology > Data Science (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

Add feedback

Deep Reinforcement Learning Essential Prerequisite Review

#artificialintelligenceMay-14-2018, 10:36:43 GMT

In this section we are going to review all the background knowledge you need to have in order to understand Deep Reinforcement Learning. This includes: ** Markov Decision Processes (MDPs) ** Dynamic Programming ** Monte Carlo ** Temporal difference learning ** Deep Learning ** Approximation Methods ** State Transition Probabilities Hope to enjoy it!

deep learning, machine learning, reinforcement learning essential prerequisite review, (1 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.45)

Add feedback

Advances in Experience Replay

Wan, Tracy, Xu, Neil

arXiv.org Machine LearningMay-14-2018

This project combines recent advances in experience replay techniques, namely, Combined Experience Replay (CER), Prioritized Experience Replay (PER), and Hindsight Experience Replay (HER). We show the results of combinations of these techniques with DDPG and DQN methods. CER always adds the most recent experience to the batch. PER chooses which experiences should be replayed based on how beneficial they will be towards learning. HER learns from failure by substituting the desired goal with the achieved goal and recomputing the reward function. The effectiveness of combinations of these experience replay techniques is tested in a variety of OpenAI gym environments.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

arXiv.org Machine Learning

1805.05536

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

Curiosity-driven Exploration for Mapless Navigation with Deep Reinforcement Learning

Zhelo, Oleksii, Zhang, Jingwei, Tai, Lei, Liu, Ming, Burgard, Wolfram

arXiv.org Artificial IntelligenceMay-14-2018

Deep Reinforcement Learning (DRL), deploying deep neural networks as function approximators for highdimensional RL tasks, achieves state of the art performance in various fields of research [1]. DRL algorithms have been studied under the context of learning navigation policies for mobile robots. Traditional navigation solutions in robotics generally require a system of procedures, such as Simultaneous Localization and Mapping (SLAM) [2], localization and path planning in a given map, etc. With the powerful representation learning capabilities of deep networks, DRL methods bring about the possibility of learning control policies directly from raw sensory inputs, bypassing all the intermediate steps. Eliminating the requirement for localization, mapping, or path planning procedures, several DRL works have been presented that learn successful navigation policies directly from raw sensor inputs: target-driven navigation [3], successor feature RL for transferring navigation policies [4], and using auxiliary tasks to boost DRL training [5]. Many followup works have also been proposed, such as embedding SLAMlike structures into DRL networks [6], or utilizing DRL for multi-robot collision avoidance [7]. In this paper, we focus specifically on mapless navigation, where the agent is expected to navigate to a designated goal location without the knowledge of the map of its current environment.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

1804.00456

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.36)

Add feedback

A Study of AI Population Dynamics with Million-agent Reinforcement Learning

Yang, Yaodong, Yu, Lantao, Bai, Yiwei, Wang, Jun, Zhang, Weinan, Wen, Ying, Yu, Yong

arXiv.org Artificial IntelligenceMay-14-2018

We conduct an empirical study on discovering the ordered collective dynamics obtained by a population of intelligence agents, driven by million-agent reinforcement learning. Our intention is to put intelligent agents into a simulated natural context and verify if the principles developed in the real world could also be used in understanding an artificially-created intelligent population. To achieve this, we simulate a large-scale predator-prey world, where the laws of the world are designed by only the findings or logical equivalence that have been discovered in nature. We endow the agents with the intelligence based on deep reinforcement learning (DRL). In order to scale the population size up to millions agents, a large-scale DRL training platform with redesigned experience buffer is proposed. Our results show that the population dynamics of AI agents, driven only by each agent's individual self-interest, reveals an ordered pattern that is similar to the Lotka-Volterra model studied in population biology. We further discover the emergent behaviors of collective adaptations in studying how the agents' grouping behaviors will change with the environmental resources. Both of the two findings could be explained by the self-organization theory in nature.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

arXiv.org Artificial Intelligence

1709.04511

Country: Europe > Sweden (0.16)

Genre: Research Report > New Finding (0.68)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.34)

Add feedback

Deep Reinforcement Learning in Python - Introduction

#artificialintelligenceMay-13-2018, 00:01:00 GMT

Requirements: • Know reinforcement learning basics, MDPs, Dynamic Programming, Monte Carlo, TD Learning • Calculus and probability at the undergraduate level • Experience building machine learning models in Python and Numpy • Know how to build a feedforward, convolutional, and recurrent neural network using Theano and Tensorflow This course is all about the application of deep learning and neural networks to reinforcement learning. If you've taken my first reinforcement learning class, then you know that reinforcement learning is on the bleeding edge of what we can do with AI. Specifically, the combination of deep learning with reinforcement learning has led to AlphaGo beating a world champion in the strategy game Go, it has led to self-driving cars, and it has led to machines that can play video games at a superhuman level. Reinforcement learning has been around since the 70s but none of this has been possible until now. The world is changing at a very fast pace.

machine learning, reinforcement, reinforcement learning, (7 more...)

#artificialintelligence

Country: North America > United States (0.20)

Industry:

Leisure & Entertainment > Games (0.96)
Information Technology (0.63)
Education (0.60)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback