Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
–Neural Information Processing Systems
Specifically, we extend the normal approximation-based lower bound for Beta distributions by Alfers and Dinges [1984] to Dirichlet distributions.
Neural Information Processing Systems
Nov-14-2025, 03:31:44 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Russia > Central Federal District
- North America > United States
- Arizona > Maricopa County
- Scottsdale (0.04)
- Massachusetts > Norfolk County
- Wellesley (0.04)
- Virginia > Arlington County
- Arlington (0.04)
- Arizona > Maricopa County
- Asia > Middle East
- Genre:
- Research Report (0.46)
- Workflow (0.46)
- Technology: