Thompson Sampling Efficiently Learns to Control Diffusion Processes

Neural Information Processing Systems 

We expect this technique to be of broader interest.