When Does Bottom-up Beat Top-down in Hierarchical Community Detection?
Dreveton, Maximilien, Kuroda, Daichi, Grossglauser, Matthias, Thiran, Patrick
–arXiv.org Artificial Intelligence
Hierarchical clustering of networks consists in finding a tree of communities, such that lower levels of the hierarchy reveal finer-grained community structures. There are two main classes of algorithms tackling this problem. Divisive ($\textit{top-down}$) algorithms recursively partition the nodes into two communities, until a stopping rule indicates that no further split is needed. In contrast, agglomerative ($\textit{bottom-up}$) algorithms first identify the smallest community structure and then repeatedly merge the communities using a $\textit{linkage}$ method. In this article, we establish theoretical guarantees for the recovery of the hierarchical tree and community structure of a Hierarchical Stochastic Block Model by a bottom-up algorithm. We also establish that this bottom-up algorithm attains the information-theoretic threshold for exact recovery at intermediate levels of the hierarchy. Notably, these recovery conditions are less restrictive compared to those existing for top-down algorithms. This shows that bottom-up algorithms extend the feasible region for achieving exact recovery at intermediate levels. Numerical experiments on both synthetic and real data sets confirm the superiority of bottom-up algorithms over top-down algorithms. We also observe that top-down algorithms can produce dendrograms with inversions. These findings contribute to a better understanding of hierarchical clustering techniques and their applications in network analysis.
arXiv.org Artificial Intelligence
Jun-1-2023
- Country:
- Africa (0.04)
- North America
- Cuba (0.04)
- United States > New York
- New York County > New York City (0.04)
- Europe
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Switzerland > Vaud
- Lausanne (0.04)
- Netherlands > South Holland
- Delft (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- United Kingdom > England
- Asia
- North Korea (0.04)
- China (0.04)
- India (0.04)
- Genre:
- Research Report (0.63)
- Workflow (0.46)
- Industry:
- Education > Educational Setting (0.46)
- Leisure & Entertainment > Sports (0.46)
- Technology: