Replicability in Reinforcement Learning

Neural Information Processing Systems 

We initiate the mathematical study of replicability as an algorithmic property in the context of reinforcement learning (RL). We focus on the fundamental setting of discounted tabular MDPs with access to a generative model .