Consistent recovery threshold of hidden nearest neighbor graphs
Ding, Jian, Wu, Yihong, Xu, Jiaming, Yang, Dana
Jian Ding, Yihong Wu, Jiaming Xu, and Dana Yang November 20, 2019 Abstract Motivated by applications such as discovering strong ties in social networks and assembling genome subsequences in biology, we study the problem of recovering a hidden 2 k -nearest neighbor (NN) graph in an n -vertex complete graph, whose edge weights are independent and distributed according to P n for edges in the hidden 2 k -NN graph and Q n otherwise. We focus on two types of asymptotic recovery guarantees as n: (1) exact recovery: all edges are classified correctly with probability tending to one; (2) almost exact recovery: the expected number of misclassified edges is o (nk). We show that the maximum likelihood estimator achieves (1) exact recovery for 2 k n o(1) if lim inf 2α n log n 1; (2) almost exact recovery for 1 k o null log n log log nnull if lim inf kD ( P n Q n) log n 1, where α n null 2 log null dP ndQ n is the R enyi divergence of order 1 2 and D (P n Q n) is the Kullback-Leibler divergence.
Nov-18-2019
- Country:
- North America > United States
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- North Carolina > Durham County
- Durham (0.04)
- New York > New York County
- New York City (0.04)
- Pennsylvania > Philadelphia County
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- Genre:
- Research Report (0.50)
- Industry:
- Health & Medicine (0.93)
- Information Technology (0.66)
- Technology: