Dopamine Bonuses

Dec-31-2001–Neural Information Processing Systems

Substantial data support a temporal difference (TO) model of dopamine (OA) neuron activity in which the cells provide a global error signal for reinforcement learning. However, in certain circumstances, OAactivity seems anomalous under the TO model, responding to non-rewarding stimuli. We address these anomalies bysuggesting that OA cells multiplex information about reward bonuses,including Sutton's exploration bonuses and Ng et al's non-distorting shaping bonuses. We interpret this additional role for OA in terms of the unconditional attentional and psychomotor effectsof dopamine, having the computational role of guiding exploration. 1 Introduction Much evidence suggests that dopamine cells in the primate midbrain play an important rolein reward and action learning. Electrophysiological studies support a theory that OA cells signal a global prediction error for summed future reward in appetitive conditioning tasks (Montague et al, 1996; Schultz et al, 1997), in the form of a temporal difference prediction error term.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Dec-31-2001

Conferences PDF

Add feedback

Country:
- Europe > United Kingdom (0.28)

Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.95)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
Dopamine Bonuses
Dopamine Bonuses

Similar Docs Excel Report more

Title	Similarity	Source
None found