PG-TS: Improved Thompson Sampling for Logistic Contextual Bandits
Bianca Dumitrascu, Karen Feng, Barbara Engelhardt
–Neural Information Processing Systems
A contextual bandit is an online learning framework for modeling sequential decision-making problems.
Neural Information Processing Systems
Nov-20-2025, 20:09:07 GMT
- Country:
- North America
- Canada > Quebec
- Montreal (0.04)
- United States > New Jersey
- Mercer County > Princeton (0.04)
- Canada > Quebec
- North America
- Genre:
- Research Report (0.70)
- Industry:
- Education (0.34)
- Health & Medicine (0.47)