Practical Issues in Temporal Difference Learning
–Neural Information Processing Systems
TO('\) is an algorithm for adjusting the weights in a connectionist network which 259 260 Tesauro has the following form:
Neural Information Processing Systems
Dec-31-1992
- Country:
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- Industry:
- Leisure & Entertainment > Games > Backgammon (0.49)
- Technology: