Stabilizing Value Function Approximation with the BFBP Algorithm

Wang, Xin, Dietterich, Thomas G.

Neural Information Processing Systems 

However, online RL algorithms such as SARSA(A) have been shown experimentally to have difficulty converging when applied with function approximators. Theoretical analysis has not been able to prove convergence, even in the case-of linear function approximators.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found