Community Detection Guarantees using Embeddings Learned by Node2Vec
–Neural Information Processing Systems
Embedding the nodes of a large network into an Euclidean space is a common objective in modern machine learning, with a variety of tools available. These embeddings can then be used as features for tasks such as community detection/node clustering or link prediction, where they achieve state of the art performance. With the exception of spectral clustering methods, there is little theoretical understanding for commonly used approaches to learning embeddings. In this work we examine the theoretical properties of the embeddings learned by node2vec. Our main result shows that the use of k-means clustering on the embedding vectors produced by node2vec gives weakly consistent community recovery for the nodes in (degree corrected) stochastic block models. We demonstrate this result empirically for both real and simulated networks, and examine how this relates to other embedding tools and machine learning procedures for network data.
Neural Information Processing Systems
May-28-2025, 06:33:02 GMT
- Country:
- North America
- Canada > British Columbia (0.14)
- United States (0.93)
- North America
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.66)
- Research Report
- Industry:
- Government (0.46)
- Information Technology (0.66)
- Technology: