PAC Bounds for Imitation and Model-based Batch Learning of Contextual Markov Decision Processes

Open in new window