On-Line Estimation of the Optimal Value Function: HJB- Estimators
–Neural Information Processing Systems
In this paper, we discuss online estimation strategies that model the optimal value function of a typical optimal control problem. We present a general strategy that uses local corridor solutions obtained via dynamic programming to provide local optimal control sequence training data for a neural architecture model of the optimal value function.
Neural Information Processing Systems
Dec-31-1993