Operator Splitting Value Iteration

Neural Information Processing Systems 

OS-VI achieves a much faster convergence rate when the model is accurate enough.