Extreme Bandits using Robust Statistics
Bhatt, Sujay, Li, Ping, Samorodnitsky, Gennady
We consider a multi-armed bandit problem motivated by situations where only the extreme values, as opposed to expected values in the classical bandit setting, are of interest. We propose distribution free algorithms using robust statistics and characterize the statistical properties. We show that the provided algorithms achieve vanishing extremal regret under weaker conditions than existing algorithms. Performance of the algorithms is demonstrated for the finite-sample setting using numerical experiments. The results show superior performance of the proposed algorithms compared to the well known algorithms.
Sep-9-2021
- Country:
- Asia
- India (0.04)
- Middle East > Israel
- Haifa District > Haifa (0.04)
- Europe
- France > Pays de la Loire
- Loire-Atlantique > Nantes (0.04)
- North Macedonia > Skopje Statistical Region
- Skopje Municipality > Skopje (0.04)
- Spain
- Andalusia > Cádiz Province
- Cadiz (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Andalusia > Cádiz Province
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- France > Pays de la Loire
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- New York
- Bronx County > New York City (0.04)
- Kings County > New York City (0.04)
- New York County > New York City (0.14)
- Queens County > New York City (0.04)
- Richmond County > New York City (0.04)
- Tompkins County > Ithaca (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Washington > King County
- Bellevue (0.04)
- Louisiana > Orleans Parish
- Canada > Quebec
- South America > Chile
- Asia
- Genre:
- Research Report (0.70)
- Technology: