Provably sample-efficient RL with side information about latent dynamics