WhenCombinatorialThompsonSamplingmeets ApproximationRegret
–Neural Information Processing Systems
At each round t N, the agent must select one arm from a fixed set ofn arms, denoted by [n], {1,...,n}, using apolicy, based on the feedback from the previous rounds.
Neural Information Processing Systems
Feb-9-2026, 17:37:41 GMT
- Country:
- North America > United States
- California (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- North America > United States
- Technology: