Scalar Posterior Sampling with Applications
Georgios Theocharous, Zheng Wen, Yasin Abbasi Yadkori, Nikos Vlassis
–Neural Information Processing Systems
Our algorithm termed deterministic schedule PSRL (DS-PSRL) is efficient in terms of time, sample, and space complexity. We prove a Bayesian regret bound under mild assumptions. Our result is more generally applicable to multiple parameters and continuous state action problems. We compare our algorithm with state-of-the-art PSRL algorithms on standard discrete and continuous problems from the literature.
Neural Information Processing Systems
Nov-20-2025, 19:46:42 GMT
- Country:
- North America
- Canada > Quebec
- Montreal (0.04)
- United States > Massachusetts
- Middlesex County > Belmont (0.04)
- Canada > Quebec
- North America
- Genre:
- Research Report > New Finding (0.48)