Thompson Sampling is Asymptotically Optimal in General Environments

Open in new window