Navigating through Temporal Difference
–Neural Information Processing Systems
Barto, Sutton and Watkins [2] introduced a grid task as a didactic example of temporal difference planning and asynchronous dynamical pre gramming. This paper considers the effects of changing the coding of the input stimulus, and demonstrates that the self-supervised learning of a particular form of hidden unit representation improves performance.
Neural Information Processing Systems
Dec-31-1991
- Country:
- Europe > United Kingdom
- England
- Cambridgeshire > Cambridge (0.14)
- Oxfordshire > Oxford (0.14)
- England
- North America > United States
- Massachusetts > Hampshire County > Amherst (0.14)
- Europe > United Kingdom
- Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.47)
- Technology: