Provably Sample-Efficient RL with Side Information about Latent Dynamics

Open in new window