Navigating through Temporal Difference
–Neural Information Processing Systems
Barto, Sutton and Watkins [2] introduced a grid task as a didactic example of temporal difference planning and asynchronous dynamical pre gramming. This paper considers the effects of changing the coding of the input stimulus, and demonstrates that the self-supervised learning of a particular form of hidden unit representation improves performance.
Neural Information Processing Systems
Dec-31-1991
- Country:
- North America > United States
- Massachusetts
- Hampshire County > Amherst (0.14)
- Middlesex County > Cambridge (0.04)
- California > San Mateo County
- San Mateo (0.05)
- Massachusetts
- Europe > United Kingdom
- England
- Oxfordshire > Oxford (0.14)
- Cambridgeshire > Cambridge (0.14)
- England
- North America > United States
- Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.47)
- Technology: