Thompson Sampling with Information Relaxation Penalties

Open in new window