Thompson Sampling and Approximate Inference
–Neural Information Processing Systems
We study the effects of approximate inference on the performance of Thompson sampling in the $k$-armed bandit problems. Thompson sampling is a successful algorithm for online decision-making but requires posterior inference, which often must be approximated in practice.
Neural Information Processing Systems
Dec-26-2025, 03:31:56 GMT
- Technology: