Stabilizing Value Function Approximation with the BFBP Algorithm
Wang, Xin, Dietterich, Thomas G.
–Neural Information Processing Systems
However, online RL algorithms such as SARSA(A) have been shown experimentally to have difficulty converging when applied with function approximators. Theoretical analysis has not been able to prove convergence, even in the case-of linear function approximators.
Neural Information Processing Systems
Dec-31-2002
- Country:
- North America > United States
- California > San Francisco County
- San Francisco (0.15)
- Oregon (0.14)
- California > San Francisco County
- North America > United States
- Technology: