Exploration with Limited Memory: Streaming Algorithms for Coin Tossing, Noisy Comparisons, and Multi-Armed Bandits
–arXiv.org Artificial Intelligence
Consider the following abstract coin tossing problem: Given a set of $n$ coins with unknown biases, find the most biased coin using a minimal number of coin tosses. This is a common abstraction of various exploration problems in theoretical computer science and machine learning and has been studied extensively over the years. In particular, algorithms with optimal sample complexity (number of coin tosses) have been known for this problem for quite some time. Motivated by applications to processing massive datasets, we study the space complexity of solving this problem with optimal number of coin tosses in the streaming model. In this model, the coins are arriving one by one and the algorithm is only allowed to store a limited number of coins at any point -- any coin not present in the memory is lost and can no longer be tossed or compared to arriving coins. Prior algorithms for the coin tossing problem with optimal sample complexity are based on iterative elimination of coins which inherently require storing all the coins, leading to memory-inefficient streaming algorithms. We remedy this state-of-affairs by presenting a series of improved streaming algorithms for this problem: we start with a simple algorithm which require storing only $O(\log{n})$ coins and then iteratively refine it further and further, leading to algorithms with $O(\log\log{(n)})$ memory, $O(\log^*{(n)})$ memory, and finally a one that only stores a single extra coin in memory -- the same exact space needed to just store the best coin throughout the stream. Furthermore, we extend our algorithms to the problem of finding the $k$ most biased coins as well as other exploration problems such as finding top-$k$ elements using noisy comparisons or finding an $\epsilon$-best arm in stochastic multi-armed bandits, and obtain efficient streaming algorithms for these problems.
arXiv.org Artificial Intelligence
Dec-26-2022
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States
- Nevada (0.04)
- District of Columbia > Washington (0.04)
- New York > New York County
- New York City (0.04)
- New Jersey > Middlesex County
- New Brunswick (0.04)
- Massachusetts
- Middlesex County > Cambridge (0.04)
- Hampshire County > Amherst (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- California > Los Angeles County
- Long Beach (0.14)
- Redondo Beach (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- United Kingdom
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England
- Greater London > London (0.04)
- Cambridgeshire > Cambridge (0.04)
- Scotland > City of Edinburgh
- Spain
- Canary Islands (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Italy > Lazio
- Rome (0.04)
- France > Hauts-de-France
- United Kingdom
- Asia
- Middle East > Israel
- Haifa District > Haifa (0.04)
- China
- Middle East > Israel
- Oceania > Australia
- Genre:
- Research Report (0.82)
- Technology:
- Information Technology
- Communications (1.00)
- Artificial Intelligence > Machine Learning (1.00)
- Data Science > Data Mining
- Big Data (0.85)
- Information Technology