Efficient Constrained $k$-Center Clustering with Background Knowledge
Guo, Longkun, Jia, Chaoqi, Liao, Kewen, Lu, Zhigang, Xue, Minhui
–arXiv.org Artificial Intelligence
Center-based clustering has attracted significant research interest from both theory and practice. In many practical applications, input data often contain background knowledge that can be used to improve clustering results. In this work, we build on widely adopted $k$-center clustering and model its input background knowledge as must-link (ML) and cannot-link (CL) constraint sets. However, most clustering problems including $k$-center are inherently $\mathcal{NP}$-hard, while the more complex constrained variants are known to suffer severer approximation and computation barriers that significantly limit their applicability. By employing a suite of techniques including reverse dominating sets, linear programming (LP) integral polyhedron, and LP duality, we arrive at the first efficient approximation algorithm for constrained $k$-center with the best possible ratio of 2. We also construct competitive baseline algorithms and empirically evaluate our approximation algorithm against them on a variety of real datasets. The results validate our theoretical findings and demonstrate the great advantages of our algorithm in terms of clustering cost, clustering quality, and running time.
arXiv.org Artificial Intelligence
Jan-23-2024
- Country:
- Asia
- Afghanistan > Parwan Province
- Charikar (0.04)
- China
- Beijing > Beijing (0.04)
- Fujian Province > Fuzhou (0.04)
- Japan (0.04)
- Middle East > Jordan (0.04)
- Afghanistan > Parwan Province
- Europe
- Finland > Uusimaa
- Helsinki (0.04)
- France
- Auvergne-Rhône-Alpes > Lyon
- Lyon (0.04)
- Grand Est > Meurthe-et-Moselle
- Nancy (0.04)
- Auvergne-Rhône-Alpes > Lyon
- Finland > Uusimaa
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- Ontario > National Capital Region
- Ottawa (0.04)
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- United States
- California
- Orange County > Newport Beach (0.04)
- Santa Clara County > Stanford (0.04)
- Florida > Orange County
- Orlando (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Maryland > Montgomery County
- Bethesda (0.04)
- New York > New York County
- New York City (0.04)
- California
- Canada
- Oceania
- Australia (0.04)
- New Zealand > North Island
- Waikato (0.04)
- Asia
- Genre:
- Research Report (1.00)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: