An objective function for order preserving hierarchical clustering
–arXiv.org Artificial Intelligence
We present a theory and an objective function for similarity-based hierarchical clustering of probabilistic partial orders and directed acyclic graphs (DAGs). Specifically, given elements $x \le y$ in the partial order, and their respective clusters $[x]$ and $[y]$, the theory yields an order relation $\le'$ on the clusters such that $[x]\le'[y]$. The theory provides a concise definition of order-preserving hierarchical clustering, and offers a classification theorem identifying the order-preserving trees (dendrograms). To determine the optimal order-preserving trees, we develop an objective function that frames the problem as a bi-objective optimisation, aiming to satisfy both the order relation and the similarity measure. We prove that the optimal trees under the objective are both order-preserving and exhibit high-quality hierarchical clustering. Since finding an optimal solution is NP-hard, we introduce a polynomial-time approximation algorithm and demonstrate that the method outperforms existing methods for order-preserving hierarchical clustering by a significant margin.
arXiv.org Artificial Intelligence
Dec-10-2024
- Country:
- Asia
- Afghanistan > Parwan Province
- Charikar (0.04)
- Middle East > Israel (0.04)
- Afghanistan > Parwan Province
- Europe > Norway
- Eastern Norway > Oslo (0.04)
- North America > United States
- Arizona (0.04)
- California (0.04)
- Idaho (0.04)
- Nevada (0.04)
- New York > New York County
- New York City (0.04)
- Oregon (0.04)
- Utah (0.04)
- South America > Chile
- Asia
- Genre:
- Research Report (0.40)
- Industry:
- Government > Regional Government (0.46)
- Technology: