Goto

Collaborating Authors

 Reinforcement Learning


A hedge fund fully managed by artificial intelligence, with Brandeis roots BrandeisNOW

#artificialintelligence

A.I. Capital Management, a Brandeis University startup seeking to build one of the world's first hedge funds fully managed by artificial intelligence, has been invited to participate in the 2018 MassChallenge Boston accelerator program. A fintech startup creating artificial intelligence trading systems for foreign exchange markets, A.I. Capital Management uses Deep Reinforcement Learning (RL) method, an algorithmic framework for programming machine behavior. A.I. Capital Management hopes to reinvent the money managing business by building a hedge fund fully managed by artificial intelligence, eliminating human error and emotion. By participating in the 2018 MassChallenge, A.I. Capital Management also gains access to top corporate partners, expert mentorship, a tailored curriculum, scholarship opportunities and more than 26,000 square-feet of co-working space in the Innovation and Design Building all at zero cost and for zero equity. At the culmination of the four-month program, A.I. Capital Management will also have a chance to compete for shares of $1.5 million in cash prizes at the MassChallenge Awards on Oct. 17.


Stroke-based Character Recognition with Deep Reinforcement Learning

arXiv.org Machine Learning

The stroke sequence of characters is significant for the character recognition task. In this paper, we propose a stroke-based character recognition (SCR) method. We train a stroke inference module under deep reinforcement learning (DRL) framework. This module extracts the sequence of strokes from characters, which can be integrated with character recognizers to improve their robustness to noise. Our experiments show that the module can handle complicated noise and reconstruct the characters. Meanwhile, it can also help achieve great ability in defending adversarial attacks of character recognizers.


Policy Gradients playing Doom deathmatch with Tensorflow (tutorial)

#artificialintelligence

If you're new in Reinforcement Learning, please read first my article "An introduction to Reinforcement Learning": https://medium.freecodecamp.org/an-in... If you have some feedbacks and advice please comment below. Moreover if you have some questions you can ask me in the comments.


What Is YOUR AI Goal? โ€“ Udacity Inc โ€“ Medium

#artificialintelligence

Udacity's School of Artificial Intelligence has officially opened our new Deep Reinforcement Learning Nanodegree program for enrollment, and in doing so, we have completed a whirlwind effort that began at Intersect back in March of this year, when our School of AI was officially unveiled to the world: Today, anyone interested in entering the incredible world of Artificial Intelligence has the opportunity to do so, through the learning portal that is our School of AI. Upon arrival to the school's home page, you are prompted by a simple question: It's actually not that simple a question, of course, but we strive to make it so by offering you clear paths to pursue, depending on your current skills and experience, and your ultimate objectives. Whether you're new to the field, or already a working professional, we offer you a point-of-entry. Whether you want to work at a company focused on AI, or bring new AI techniques to a company that can benefit from them, we offer tailored curriculum to support your journey. Perhaps you're simply a future-minded thinker who sees where the world is headed, and you want to start planning ahead by adding valuable skills to your toolkit now.


Deep Reinforcement Learning: An Overview

arXiv.org Artificial Intelligence

In recent years, a specific machine learning method called deep learning has gained huge attraction, as it has obtained astonishing results in broad applications such as pattern recognition, speech recognition, computer vision, and natural language processing. Recent research has also been shown that deep learning techniques can be combined with reinforcement learning methods to learn useful representations for the problems with high dimensional raw data input. This chapter reviews the recent advances in deep reinforcement learning with a focus on the most used deep architectures such as autoencoders, convolutional neural networks and recurrent neural networks which have successfully been come together with the reinforcement learning framework.


Many-Goals Reinforcement Learning

arXiv.org Artificial Intelligence

All-goals updating exploits the off-policy nature of Q-learning to update all possible goals an agent could have from each transition in the world, and was introduced into Reinforcement Learning (RL) by Kaelbling (1993). In prior work this was mostly explored in small-state RL problems that allowed tabular representations and where all possible goals could be explicitly enumerated and learned separately. In this paper we empirically explore 3 different extensions of the idea of updating many (instead of all) goals in the context of RL with deep neural networks (or DeepRL for short). First, in a direct adaptation of Kaelbling's approach we explore if many-goals updating can be used to achieve mastery in non-tabular visual-observation domains. Second, we explore whether many-goals updating can be used to pre-train a network to subsequently learn faster and better on a single main task of interest. Third, we explore whether many-goals updating can be used to provide auxiliary task updates in training a network to learn faster and better on a single main task of interest. We provide comparisons to baselines for each of the 3 extensions.


Meta-Learning by the Baldwin Effect

arXiv.org Artificial Intelligence

The scope of the Baldwin effect was recently called into question by two papers that closely examined the seminal work of Hinton and Nowlan. To this date there has been no demonstration of its necessity in empirically challenging tasks. Here we show that the Baldwin effect is capable of evolving few-shot supervised and reinforcement learning mechanisms, by shaping the hyperparameters and the initial parameters of deep learning algorithms. Furthermore it can genetically accommodate strong learning biases on the same set of problems as a recent machine learning algorithm called MAML "Model Agnostic Meta-Learning" which uses second-order gradients instead of evolution to learn a set of reference parameters (initial weights) that can allow rapid adaptation to tasks sampled from a distribution. Whilst in simple cases MAML is more data efficient than the Baldwin effect, the Baldwin effect is more general in that it does not require gradients to be backpropagated to the reference parameters or hyperparameters, and permits effectively any number of gradient updates in the inner loop. The Baldwin effect learns strong learning dependent biases, rather than purely genetically accommodating fixed behaviours in a learning independent manner.



Optimal Path Detection With Reinforcement Learning - DZone AI

#artificialintelligence

In this article, I will design an agent that finds the optimum path through a given map using Reinforcement Learning. I hope it becomes a useful article in the sense of awareness. Reinforcement Learning (RL) is a machine learning technique that deals with the problems of finding the optimum actions that must be done in a given situation in order to maximize rewards. This learning technique, which is inspired by behavioral psychology, is usually described as follows. An agent in any environment makes certain movements in this environment and gains rewards as a result of these movements.


Microsoft acquires AI startup to fuel artificial intelligence capabilities

#artificialintelligence

SAN FRANCISCO: Microsoft announced on Wednesday that it has signed an agreement to acquire Bonsai, an artificial intelligence (AI) startup based in San Francisco, to boost its AI and machine learning capabilities. Microsoft said its acquisition of the small startup is "another major step forward in our vision to make it easier for developers and subject matter experts to build the "brains -- machine learning model for autonomous systems of all kinds." In its official blog, Microsoft said Bonsai has developed technology that will let experts with AI experience to work with autonomous systems, reports Xinhua news agency. "The company is building a general-purpose, deep reinforcement learning platform especially suited for enterprises leveraging industrial control systems such as robotics, energy, HVAC, manufacturing and autonomous systems in general," said the tech giant. Bonsai's platform combined with rich simulation tools and reinforcement learning work in Microsoft Research will compose with its Azure Machine Learning running on the Azure Cloud with GPUs and Brainwave, it added.