Nearest-Neighbor Sample Compression: Efficiency, Consistency, Infinite Dimensions

Aryeh Kontorovich, Sivan Sabato, Roi Weiss

Oct-3-2024, 22:50:32 GMT–Neural Information Processing Systems

We examine the Bayes-consistency of a recently proposed 1-nearest-neighbor-based multiclass learning algorithm. This algorithm is derived from sample compression bounds and enjoys the statistical advantages of tight, fully empirical generalization bounds, as well as the algorithmic advantages of a faster runtime and memory savings. We prove that this algorithm is strongly Bayes-consistent in metric spaces with finite doubling dimension -- the first consistency result for an efficient nearest-neighbor sample compression scheme. Rather surprisingly, we discover that this algorithm continues to be Bayes-consistent even in a certain infinitedimensional setting, in which the basic measure-theoretic conditions on which classic consistency proofs hinge are violated. This is all the more surprising, since it is known that k-NN is not Bayes-consistent in this setting. We pose several challenging open problems for future research.

algorithm, bayes-consistent, metric space, (16 more...)

Neural Information Processing Systems

Oct-3-2024, 22:50:32 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - New York > New York County
    - New York City (0.04)
  - California > Los Angeles County
    - Long Beach (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
  - Israel (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Case-Based Reasoning (0.84)
  - Machine Learning > Statistical Learning
    - Nearest Neighbor Methods (0.46)

Duplicate Docs Excel Report

Title
Nearest-Neighbor Sample Compression: Efficiency, Consistency, Infinite Dimensions
Nearest-Neighbor Sample Compression: Efficiency, Consistency, Infinite Dimensions

Similar Docs Excel Report more

Title	Similarity	Source
None found