Provably sample-efficient RL with side information about latent dynamics

Open in new window