Stable Dual Dynamic Programming

Wang, Tao, Bowling, Michael, Schuurmans, Dale, Lizotte, Daniel J.

Neural Information Processing Systems 

Recently, we have introduced a novel approach to dynamic programming and reinforcement learning that is based on maintaining explicit representations of stationary distributions instead of value functions. In this paper, we investigate the convergence properties of these dual algorithms both theoretically and empirically, and show how they can be scaled up by incorporating function approximation.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found