Robust Anytime Learning of Markov Decision Processes

Neural Information Processing Systems 

Markov decision processes (MDPs) are formal models commonly used in sequential decision-making.