Can Evolutionary Clustering Have Theoretical Guarantees?
–arXiv.org Artificial Intelligence
Clustering is a fundamental problem in many areas, which aims to partition a given data set into groups based on some distance measure, such that the data points in the same group are similar while that in different groups are dissimilar. Due to its importance and NP-hardness, a lot of methods have been proposed, among which evolutionary algorithms are a class of popular ones. Evolutionary clustering has found many successful applications, but all the results are empirical, lacking theoretical support. This paper fills this gap by proving that the approximation performance of the GSEMO (a simple multi-objective evolutionary algorithm) for solving four formulations of clustering, i.e., $k$-tMM, $k$-center, discrete $k$-median and $k$-means, can be theoretically guaranteed. Furthermore, we consider clustering under fairness, which tries to avoid algorithmic bias, and has recently been an important research topic in machine learning. We prove that for discrete $k$-median clustering under individual fairness, the approximation performance of the GSEMO can be theoretically guaranteed with respect to both the objective function and the fairness constraint.
arXiv.org Artificial Intelligence
Jul-22-2023
- Country:
- South America > Argentina
- Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States
- New Jersey > Hudson County
- Hoboken (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Illinois > Cook County
- Chicago (0.04)
- California > Los Angeles County
- Long Beach (0.04)
- New Jersey > Hudson County
- Mexico > Quintana Roo
- Cancún (0.04)
- Canada > British Columbia
- United States
- Europe
- United Kingdom
- Wales > Ceredigion
- Aberystwyth (0.04)
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- Wales > Ceredigion
- Switzerland > Zürich
- Zürich (0.14)
- Sweden > Stockholm
- Stockholm (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Germany
- Berlin (0.04)
- North Rhine-Westphalia > Arnsberg Region
- Dortmund (0.04)
- United Kingdom
- Asia
- Macao (0.04)
- Singapore (0.04)
- China > Jiangsu Province
- Nanjing (0.04)
- South America > Argentina
- Genre:
- Research Report (0.50)
- Industry:
- Education (0.46)
- Technology: