Fast Approximate Dynamic Programming for Infinite-Horizon Markov Decision Processes

Neural Information Processing Systems 

In this study, we consider the infinite-horizon, discounted cost, optimal control of stochastic nonlinear systems with separable cost and constraints in the state and input variables.