AITopics | Reinforcement Learning

Collaborating Authors

Reinforcement Learning

"Reinforcement learning is learning what to do – how to map situations to actions – so as to maximize a numerical reward signal. The learner is not told which actions to take, as in most forms of machine learning, but instead must discover which actions yield the most reward by trying them."
– Sutton, Richard S. and Andrew G. Barto. Reinforcement Learning: An Introduction. (1.1). MIT Press, Cambridge, MA, 1998.

News Overviews Instructional Materials AI-Alerts Classics

Stochastic Learning Networks and their Electronic Implementation

Alspector, Joshua, Allen, Robert B., Hu, Victor, Satyanarayana, Srinagesh

Neural Information Processing SystemsDec-31-1988

This paper focuses on the issue of learning in these networks especially with regard to their implementation in an electronic system. Learning phenomena that have been studied include associative memoryllJ.

neuron, procedure, synapse, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > United States > New York (0.04)
North America > United States > District of Columbia > Washington (0.04)
North America > United States > California (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Stochastic Learning Networks and their Electronic Implementation

Alspector, Joshua, Allen, Robert B., Hu, Victor, Satyanarayana, Srinagesh

Neural Information Processing SystemsDec-31-1988

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Learning to predict by the methods of temporal difference

Sutton, Richard S.

ClassicsFeb-1-1988

This article introduces a class of incremental learning procedures specializedfor prediction that is, for using past experience with an incompletely knownsystem to predict its future behavior. Whereas conventional prediction-learningmethods assign credit by means of the difference between predicted and actual outcomes,tile new methods assign credit by means of the difference between temporallysuccessive predictions. Although such temporal-difference method~ have been used inSamuel's checker player, Holland's bucket brigade, and the author's Adaptive HeuristicCritic, they have remained poorly understood. Here we prove their convergenceand optimality for special cases and relate them to supervised-learning methods. Formost real-world prediction problems, telnporal-differenee methods require less memoryand less peak computation than conventional methods and they produce moreaccurate predictions. We argue that most problems to which supervised learningis currently applied are really prediction problemsMachine Learning 3: 9-44, erratum p. 377

machine learning, prediction, reinforcement learning, (20 more...)

Classics

Country:

North America > United States > California > Orange County > Irvine (0.14)
North America > United States > New York (0.04)
North America > United States > California > Santa Clara County > Los Altos (0.04)
(10 more...)

Genre: Workflow (0.67)

Industry: Leisure & Entertainment > Games (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Learning to predict by the methods of temporal differences

Sutton, R. S.

ClassicsFeb-1-1988

Machine Learning, 3, 9–44.

learning, temporal difference

Classics

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

Add feedback

Associative search network: A reinforcement learning associative memory

Barto, A. G. | Sutton, R. S. | Brouwer, P. S.

ClassicsFeb-1-1981

An associative memory system is presented which does not require a "teacher" to provide the desired associations. For each input key it conducts a search for the output pattern which optimizes an external payoff or reinforcement signal. The associative search network (ASN) combines pattern recognition and function optimization capabilities in a simple and effective way. We define the associative search problem, discuss conditions under which the associative search network is capable of solving it, and present results from computer simulations. The synthesis of sensory-motor control surfaces is discussed as an example of the associative search problem.

artificial intelligence, machine learning, reinforcement learning, (4 more...)

Classics

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.80)
Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (0.70)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.70)

Add feedback