Replicability in Reinforcement Learning
–Neural Information Processing Systems
We initiate the mathematical study of replicability as an algorithmic property in the context of reinforcement learning (RL). We focus on the fundamental setting of discounted tabular MDPs with access to a generative model. Inspired by Impagliazzo et al. [2022], we say that an RL algorithm is replicable if, with high probability, it outputs the exact same policy after two executions on i.i.d.
Neural Information Processing Systems
Dec-27-2025, 03:24:29 GMT
- Technology: