AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

Elon Musk Opens 1 Billion "Robot Gym" for Artificial Intelligence Training

#artificialintelligenceMay-9-2016, 22:55:43 GMT

The space and renewable energy tycoon has launched yet another pioneer non-profit project which can change the world of robotics today. Elon Musk developed a 1 billion-program "Robot Gym" to train artificial intelligence. He says he wanted the robots to be as versatile as humans when it comes to thinking. Our reinforcement learning toolkit, OpenAI Gym, is now in public beta: https://t.co/8fMRLjSJQb. Tesla's CEO, together with Sam Altman, created the artificial intelligence non-profit research company.

large language model, machine learning, reinforcement learning, (10 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.42)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
(2 more...)

Add feedback

Semi-Unsupervised Clustering Using Reinforcement Learning

Bose, Sourabh (The University of Texas at Arlington) | Huber, Manfred (The University of Texas at Arlington)

AAAI ConferencesMay-8-2016

Clusters defined over a dataset by unsupervised clustering often present groupings which differ from the expected solution. This is primarily the case when some scarce knowledge of the problem exists beforehand that partially identifies desired characteristics of clusters. However conventional clustering algorithms are not defined to expect any supervision from the external world, as they are supposed to be completely unsupervised. As a result they can not benefit or effectively take into account available information about the use or properties of the clusters. In this paper we propose a reinforcement learning approach to address this problem where existing, unmodified unsupervised clustering algorithms are augmented in a way that the available sparse information is utilized to achieve more appropriate clusters. Our model works with any clustering algorithm, but the input to the algorithm, instead of being the original dataset, is a scaled version of the same, where the scaling factors are determined by the reinforcement learning algorithm.

reinforcement learning, semi-unsupervised clustering

AAAI Conferences

The Twenty-Ninth International Flairs Conference

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Reward from Demonstration in Interactive Reinforcement Learning

Raza, Syed Ali (University of Technology, Sydney) | Johnston, Benjamin (University of Technology, Sydney) | Williams, Mary-Anne (University of Technology, Sydney)

AAAI ConferencesMay-8-2016

In reinforcement learning (RL), reward shaping is used to show the desirable behavior by assigning positive or negative reward for learner’s preceding action. However, for reward shaping through human-generated rewards, an important aspect is to make it approachable to humans. Typically, a human teacher’s role requires being watchful of agent’s action to assign judgmental feedback based on prior knowledge. It can be a mentally tough and unpleasant exercise especially for lengthy teaching sessions. We present a method, Shaping from Interactive Demonstrations (SfID), which instead of judgmental reward takes action label from human. Therefore, it simplifies the teacher’s role to demonstrating the action to select from a state. We compare SfID with a standard reward shaping approach on Sokoban domain. The results show the competitiveness of SfID with the standard reward shaping.

artificial intelligence, interactive reinforcement learning, machine learning, (1 more...)

AAAI Conferences

The Twenty-Ninth International Flairs Conference

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.60)

Add feedback

The Hitchhiker Guide to A.I.

#artificialintelligenceMay-6-2016, 20:45:20 GMT

'If there is Artificial Intelligence then we won't need to think' said my recently-turned-ten daughter, on our way driving her to a piano lesson. She left me speechless while I was trying, in vain, to remove from my mind a sticky image from the movie'The Hitchhiker guide to the Galaxy'. A supercomputer called'Deep Thought' which, after 7,5 million years of calculations rendered the answer to the Ultimate Question of Life, the Universe and Everything, or, namely, the number 42. Quirky cult movies aside, the impact of artificial intelligence is immense, and the implications phenomenal. Artificial Intelligence (A.I.) is on the hype again. There are dystopian, terminator-like views backed up with warnings from renowned scientists, Stephen Hawking to start with. Even Tesla and SpaceX founder Elon Musk appeared on CNBC's'Closing Bell' cautioning about the potential risks of this technology.

data mining, machine learning, reinforcement learning, (17 more...)

#artificialintelligence

Industry:

Leisure & Entertainment (1.00)
Banking & Finance > Trading (1.00)
Information Technology (0.98)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.97)
Information Technology > Data Science > Data Mining > Big Data (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.30)

Add feedback

Open AI Gym: What do you think about the Reinforcement Learning toolkit and its evaluation methods? • /r/MachineLearning

@machinelearnbotMay-6-2016, 11:50:51 GMT

Open AI Gym: What do you think about the Reinforcement Learning toolkit and its evaluation methods? Its been approximately 10 days since the release of Gym. How has your experience with it been so far? Do you think academic/non-academic researchers will embrace it and use it as benchmarks in their publications? Or will it just end up as some highly decorated playground over time? More importantly, do you think it will do anything to accelerate AI research as claimed??

artificial intelligence, machine learning, reinforcement learning toolkit, (3 more...)

@machinelearnbot

Genre: Personal > Interview (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Add feedback

Sutton's Reinforcement Learning • /r/MachineLearning

@machinelearnbotMay-6-2016, 08:20:11 GMT

The theory for RL hasn't changed - people were using NNs to train RL models years ago. Deep Learning is just a different kind of NN, the underlying theory for RL still applies.

deep learning, machinelearning, reinforcement learning, (3 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.44)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

Add feedback

Elon Musk opens virtual gym to train your robots

#artificialintelligenceMay-2-2016, 07:31:09 GMT

High-tech entrepreneur Elon Musk has launched an open-source training "gym" for artificial-intelligence programmers. It's an interesting move for a man who in 2014 said artificial intelligence, or A.I., will pose a threat to the human race. "I think we should be very careful about artificial intelligence," Musk said about a year and a half ago during an MIT symposium. "If I were to guess at what our biggest existential threat is, it's probably that... with artificial intelligence, we are summoning the demon. In all those stories with the guy with the pentagram and the holy water, and he's sure he can control the demon. Today, Musk is moving to help programmers use A.I. and machine learning to build smart robots and smart devices. "We're releasing the public beta of OpenAI Gym, a toolkit for developing and comparing reinforcement learning (RL) algorithms," wrote Greg Brockman, OpenAI's CTO, and John Schulman, a scientist working with OpenAI, in a blog post . "We originally built OpenAI Gym as a tool to accelerate our own RL research.

large language model, machine learning, reinforcement learning, (13 more...)

#artificialintelligence

Industry:

Transportation (0.38)
Automobiles & Trucks (0.32)
Aerospace & Defense (0.32)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.98)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.98)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.98)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.61)

Add feedback

Train Your Reinforcement Learning Agents at the OpenAI Gym

#artificialintelligenceMay-2-2016, 04:00:29 GMT

Today OpenAI, a non-profit artificial intelligence research company, launched OpenAI Gym, a toolkit for developing and comparing reinforcement learning algorithms. It supports teaching agents everything from walking to playing games like Pong or Go. OpenAI researcher John Schulman shared some details about his organization, and how OpenAI Gym will make it easier for AI researchers to design, iterate and improve their next generation applications. John studied physics at Caltech, and went to UC Berkeley for graduate school. There, after a brief stint in neuroscience, he studied machine learning and robotics under Pieter Abbeel, eventually honing in on reinforcement learning as his primary topic of interest.

large language model, machine learning, reinforcement learning, (16 more...)

#artificialintelligence

Country: North America > United States > California > Alameda County > Berkeley (0.05)

Industry: Education > Educational Setting > Higher Education (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

[1604.07097] Neurohex: A Deep Q-learning Hex Agent • /r/MachineLearning

@machinelearnbotMay-2-2016, 00:35:04 GMT

I posted about this earlier but mistakenly clicked text instead of link and didn't link the article so here it is. We applied Deep Q-learning with a supervised initialization to the board-game hex and obtained promising results. This was somewhat surprising to me as I suspected statistical bias involved in Q-learning may be a major issue with the large space of actions involved in a stone placement game. Code can be found here.

artificial intelligence, machine learning, machinelearning, (3 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback