Nearly Tight Bounds for the Continuum-Armed Bandit Problem
–Neural Information Processing Systems
In the multi-armed bandit problem, an online algorithm must choose from a set of strategies in a sequence of n trials so as to minimize the total cost of the chosen strategies. While nearly tight upper and lower bounds are known in the case when the strategy set is finite, much less is known when there is an infinite strategy set.
Neural Information Processing Systems
Dec-31-2005
- Country:
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
- Technology: