N$^2$: A Unified Python Package and Test Bench for Nearest Neighbor-Based Matrix Completion
Chin, Caleb, Khubchandani, Aashish, Maskara, Harshvardhan, Choi, Kyuseong, Feitelberg, Jacob, Gong, Albert, Paul, Manit, Sadhukhan, Tathagata, Agarwal, Anish, Dwivedi, Raaz
Nearest neighbor (NN) methods have re-emerged as competitive tools for matrix completion, offering strong empirical performance and recent theoretical guarantees, including entry-wise error bounds, confidence intervals, and minimax optimality. Despite their simplicity, recent work has shown that NN approaches are robust to a range of missingness patterns and effective across diverse applications. This paper introduces N$^2$, a unified Python package and testbed that consolidates a broad class of NN-based methods through a modular, extensible interface. Built for both researchers and practitioners, N$^2$ supports rapid experimentation and benchmarking. Using this framework, we introduce a new NN variant that achieves state-of-the-art results in several settings. We also release a benchmark suite of real-world datasets, from healthcare and recommender systems to causal inference and LLM evaluation, designed to stress-test matrix completion methods beyond synthetic scenarios. Our experiments demonstrate that while classical methods excel on idealized data, NN-based techniques consistently outperform them in real-world settings.
Jun-5-2025
- Country:
- Europe > Germany
- Rhineland-Palatinate > Kaiserslautern (0.04)
- North America > United States
- California (0.05)
- Colorado (0.04)
- Connecticut (0.04)
- Montana (0.04)
- Nevada (0.04)
- Pennsylvania (0.04)
- Utah (0.04)
- Europe > Germany
- Genre:
- Research Report > Experimental Study (0.46)
- Industry:
- Health & Medicine
- Consumer Health (0.46)
- Public Health (0.46)
- Health & Medicine