AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

A hedge fund fully managed by artificial intelligence, with Brandeis roots BrandeisNOW

#artificialintelligenceJun-23-2018, 16:01:13 GMT

A.I. Capital Management, a Brandeis University startup seeking to build one of the world's first hedge funds fully managed by artificial intelligence, has been invited to participate in the 2018 MassChallenge Boston accelerator program. A fintech startup creating artificial intelligence trading systems for foreign exchange markets, A.I. Capital Management uses Deep Reinforcement Learning (RL) method, an algorithmic framework for programming machine behavior. A.I. Capital Management hopes to reinvent the money managing business by building a hedge fund fully managed by artificial intelligence, eliminating human error and emotion. By participating in the 2018 MassChallenge, A.I. Capital Management also gains access to top corporate partners, expert mentorship, a tailored curriculum, scholarship opportunities and more than 26,000 square-feet of co-working space in the Innovation and Design Building all at zero cost and for zero equity. At the culmination of the four-month program, A.I. Capital Management will also have a chance to compete for shares of $1.5 million in cash prizes at the MassChallenge Awards on Oct. 17.

artificial intelligence, machine learning, reinforcement learning, (5 more...)

#artificialintelligence

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > e-Commerce > Financial Technology (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.59)

Add feedback

Stroke-based Character Recognition with Deep Reinforcement Learning

Huang, Zhewei, Heng, Wen, Tao, Yuanzheng, Zhou, Shuchang

arXiv.org Machine LearningJun-23-2018

The stroke sequence of characters is significant for the character recognition task. In this paper, we propose a stroke-based character recognition (SCR) method. We train a stroke inference module under deep reinforcement learning (DRL) framework. This module extracts the sequence of strokes from characters, which can be integrated with character recognizers to improve their robustness to noise. Our experiments show that the module can handle complicated noise and reconstruct the characters. Meanwhile, it can also help achieve great ability in defending adversarial attacks of character recognizers.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Machine Learning

1806.0899

Genre: Research Report (0.50)

Industry:

Leisure & Entertainment > Games > Computer Games (0.93)
Information Technology (0.90)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

Policy Gradients playing Doom deathmatch with Tensorflow (tutorial)

#artificialintelligenceJun-22-2018, 05:41:16 GMT

If you're new in Reinforcement Learning, please read first my article "An introduction to Reinforcement Learning": https://medium.freecodecamp.org/an-in... If you have some feedbacks and advice please comment below. Moreover if you have some questions you can ask me in the comments.

artificial intelligence, machine learning, reinforcement learning, (5 more...)

#artificialintelligence

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

What Is YOUR AI Goal? – Udacity Inc – Medium

#artificialintelligenceJun-22-2018, 00:30:59 GMT

Udacity's School of Artificial Intelligence has officially opened our new Deep Reinforcement Learning Nanodegree program for enrollment, and in doing so, we have completed a whirlwind effort that began at Intersect back in March of this year, when our School of AI was officially unveiled to the world: Today, anyone interested in entering the incredible world of Artificial Intelligence has the opportunity to do so, through the learning portal that is our School of AI. Upon arrival to the school's home page, you are prompted by a simple question: It's actually not that simple a question, of course, but we strive to make it so by offering you clear paths to pursue, depending on your current skills and experience, and your ultimate objectives. Whether you're new to the field, or already a working professional, we offer you a point-of-entry. Whether you want to work at a company focused on AI, or bring new AI techniques to a company that can benefit from them, we offer tailored curriculum to support your journey. Perhaps you're simply a future-minded thinker who sees where the world is headed, and you want to start planning ahead by adding valuable skills to your toolkit now.

artificial intelligence, deep reinforcement learning nanodegree program, machine learning, (4 more...)

#artificialintelligence

Country: North America > United States > California (0.06)

Industry:

Education > Educational Setting > Online (0.97)
Education > Educational Technology > Educational Software > Computer Based Training (0.66)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.61)

Add feedback

Deep Reinforcement Learning: An Overview

Mousavi, Seyed Sajad, Schukat, Michael, Howley, Enda

arXiv.org Artificial IntelligenceJun-22-2018

In recent years, a specific machine learning method called deep learning has gained huge attraction, as it has obtained astonishing results in broad applications such as pattern recognition, speech recognition, computer vision, and natural language processing. Recent research has also been shown that deep learning techniques can be combined with reinforcement learning methods to learn useful representations for the problems with high dimensional raw data input. This chapter reviews the recent advances in deep reinforcement learning with a focus on the most used deep architectures such as autoencoders, convolutional neural networks and recurrent neural networks which have successfully been come together with the reinforcement learning framework.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-3-319-56991-8_32

1806.08894

Country: Europe > Ireland (0.04)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.69)

Add feedback

Many-Goals Reinforcement Learning

Veeriah, Vivek, Oh, Junhyuk, Singh, Satinder

arXiv.org Artificial IntelligenceJun-22-2018

All-goals updating exploits the off-policy nature of Q-learning to update all possible goals an agent could have from each transition in the world, and was introduced into Reinforcement Learning (RL) by Kaelbling (1993). In prior work this was mostly explored in small-state RL problems that allowed tabular representations and where all possible goals could be explicitly enumerated and learned separately. In this paper we empirically explore 3 different extensions of the idea of updating many (instead of all) goals in the context of RL with deep neural networks (or DeepRL for short). First, in a direct adaptation of Kaelbling's approach we explore if many-goals updating can be used to achieve mastery in non-tabular visual-observation domains. Second, we explore whether many-goals updating can be used to pre-train a network to subsequently learn faster and better on a single main task of interest. Third, we explore whether many-goals updating can be used to provide auxiliary task updates in training a network to learn faster and better on a single main task of interest. We provide comparisons to baselines for each of the 3 extensions.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

1806.09605

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
North America > United States > Texas > Travis County > Austin (0.14)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games > Computer Games (0.33)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Meta-Learning by the Baldwin Effect

Fernando, Chrisantha Thomas, Sygnowski, Jakub, Osindero, Simon, Wang, Jane, Schaul, Tom, Teplyashin, Denis, Sprechmann, Pablo, Pritzel, Alexander, Rusu, Andrei A.

arXiv.org Artificial IntelligenceJun-22-2018

The scope of the Baldwin effect was recently called into question by two papers that closely examined the seminal work of Hinton and Nowlan. To this date there has been no demonstration of its necessity in empirically challenging tasks. Here we show that the Baldwin effect is capable of evolving few-shot supervised and reinforcement learning mechanisms, by shaping the hyperparameters and the initial parameters of deep learning algorithms. Furthermore it can genetically accommodate strong learning biases on the same set of problems as a recent machine learning algorithm called MAML "Model Agnostic Meta-Learning" which uses second-order gradients instead of evolution to learn a set of reference parameters (initial weights) that can allow rapid adaptation to tasks sampled from a distribution. Whilst in simple cases MAML is more data efficient than the Baldwin effect, the Baldwin effect is more general in that it does not require gradients to be backpropagated to the reference parameters or hyperparameters, and permits effectively any number of gradient updates in the inner loop. The Baldwin effect learns strong learning dependent biases, rather than purely genetically accommodating fixed behaviours in a learning independent manner.

evolutionary algorithm, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3205651.3205763

1806.07917

Genre: Research Report (0.40)

Industry: Health & Medicine (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.90)

Add feedback

r/MachineLearning - [P] Policy Gradients with Doom and Tensorflow (tutorial)

#artificialintelligenceJun-21-2018, 15:41:56 GMT

Again let me say what you think about the course (articles and videos) and how it should be improved!

artificial intelligence, machine learning, reinforcement learning, (9 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.70)

Industry:

Education (0.44)
Media > News (0.40)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.75)

Add feedback

Optimal Path Detection With Reinforcement Learning - DZone AI

#artificialintelligenceJun-21-2018, 09:01:47 GMT

In this article, I will design an agent that finds the optimum path through a given map using Reinforcement Learning. I hope it becomes a useful article in the sense of awareness. Reinforcement Learning (RL) is a machine learning technique that deals with the problems of finding the optimum actions that must be done in a given situation in order to maximize rewards. This learning technique, which is inspired by behavioral psychology, is usually described as follows. An agent in any environment makes certain movements in this environment and gains rewards as a result of these movements.

artificial intelligence, machine learning, reinforcement learning, (4 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Microsoft acquires AI startup to fuel artificial intelligence capabilities

#artificialintelligenceJun-21-2018, 09:01:10 GMT

SAN FRANCISCO: Microsoft announced on Wednesday that it has signed an agreement to acquire Bonsai, an artificial intelligence (AI) startup based in San Francisco, to boost its AI and machine learning capabilities. Microsoft said its acquisition of the small startup is "another major step forward in our vision to make it easier for developers and subject matter experts to build the "brains -- machine learning model for autonomous systems of all kinds." In its official blog, Microsoft said Bonsai has developed technology that will let experts with AI experience to work with autonomous systems, reports Xinhua news agency. "The company is building a general-purpose, deep reinforcement learning platform especially suited for enterprises leveraging industrial control systems such as robotics, energy, HVAC, manufacturing and autonomous systems in general," said the tech giant. Bonsai's platform combined with rich simulation tools and reinforcement learning work in Microsoft Research will compose with its Azure Machine Learning running on the Azure Cloud with GPUs and Brainwave, it added.

artificial intelligence, machine learning, reinforcement learning, (8 more...)

#artificialintelligence

Country:

North America > United States > California > San Francisco County > San Francisco (0.52)
North America > United States > California > Alameda County > Berkeley (0.08)

Industry: Construction & Engineering (0.85)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.92)

Add feedback