Clustering with Distributed Data

Jan-1-2019–arXiv.org Machine Learning

We consider K-means clustering in networked environments (e.g., internet of things (IoT) and sensor networks) where data is inherently distributed across nodes and processing power at each node may be limited. We consider a clustering algorithm referred to as networked K-means, or N K-means, which relies only on local neighborhood information exchange. Information exchange is limited to low-dimensional statistics and not raw data at the agents. The proposed approach develops a parametric family of multi-agent clustering objectives (parameterized by ρ) and associated distributed N K-means algorithms (also parameterized by ρ). The NK-means algorithm with parameter ρ converges to a set of fixed points relative to the associated multi-agent objective (designated as generalized minima). By appropriate choice of ρ, the set of generalized minima may be brought arbitrarily close to the set of Lloyd's minima. Thus, the NK-means algorithm may be used to compute Lloyd's minima of the collective dataset up to arbitrary accuracy. Keywords: K-means clustering, Lloyd's minima, distributed algorithms, distributed machine learning, network information processing

algorithm, lloyd, minima, (16 more...)

arXiv.org Machine Learning

Jan-1-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - District of Columbia > Washington (0.04)
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.14)
  - New York > New York County
    - New York City (0.04)
  - New Jersey > Mercer County
    - Princeton (0.04)
  - Nevada > Clark County
    - Las Vegas (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
  - Florida > Miami-Dade County
    - Miami (0.04)
- Asia > Middle East
  - Jordan (0.04)
  - Iran > Tehran Province
    - Tehran (0.04)

Genre:
- Research Report (0.64)

Industry:
- Information Technology > Security & Privacy (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found