Bandit-Based Monte Carlo Optimization for Nearest Neighbors

Bagaria, Vivek, Baharav, Tavor Z., Kamath, Govinda M., Tse, David N.

Dec-22-2020–arXiv.org Machine Learning

The celebrated Monte Carlo method estimates an expensive-to-compute quantity by random sampling. Bandit-based Monte Carlo optimization is a general technique for computing the minimum of many such expensive-to-compute quantities by adaptive random sampling. The technique converts an optimization problem into a statistical estimation problem which is then solved via multi-armed bandits. We apply this technique to solve the problem of high-dimensional k-nearest neighbors, developing an algorithm which we prove is able to identify exact nearest neighbors with high probability. We show that under regularity assumptions on a dataset of n points in d-dimensional space, the complexity of our algorithm scales logarithmically with the dimension of the data as $O((n+d)\log^2 (\frac{nd}{\delta}))$ for error probability $\delta$, rather than linearly as in exact computation requiring O(nd). We corroborate our theoretical results with numerical simulations, showing that our algorithm outperforms both exact computation and state-of-the-art algorithms such as kGraph, NGT, and LSH on real datasets.

algorithm, computation, neighbor, (15 more...)

arXiv.org Machine Learning

Dec-22-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California > Santa Clara County > Palo Alto (0.04)
- Asia > Afghanistan
  - Parwan Province > Charikar (0.04)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (0.69)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.89)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Machine Learning > Statistical Learning
      - Nearest Neighbor Methods (0.71)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found