Reinforcement Learning under Latent Dynamics: Toward Statistical and Algorithmic Modularity

Neural Information Processing Systems 

Real-world applications of reinforcement learning often involve environments where agents operate on complex, high-dimensional observations, but the underlying ("latent") dynamics are comparatively simple.