Practical Issues in Temporal Difference Learning
–Neural Information Processing Systems
TO('\) is an algorithm for adjusting the weights in a connectionist network which 259 260 Tesauro has the following form:
Neural Information Processing Systems
Dec-31-1992