Accelerating Value Iteration with Anchoring

Neural Information Processing Systems 

In this paper, we present the first accelerated VI for both the Bellman consistency and optimality operators.