Nearly-Linear Time and Massively Parallel Algorithms for k-Anonymity

Jun-22-2026, 20:18:12 GMT–Neural Information Processing Systems

Previous algorithms with provable guarantees either (1) achieve the same O(k)approximation ratio but require at least O(n2k) runtime, or (2) provide a better O(logk) approximation ratio at the cost of an impractical O(n2k) worst-case runtime for general d and k. Our algorithm extends to the Massively Parallel Computation (MPC) model, where it gives an MPC algorithm requiring eO(log1+ε n) rounds and total space O(n1+γ(d+k)). Empirically, we also demonstrate that our algorithmic ideas can be adapted to existing heuristic methods, leading to significant speed-ups while preserving comparable performance. On the hardness side, we study the related single-point k-anonymity problem, where the goal is to select k 1 additional records to make a given record indistinguishable. Assuming the dense vs random conjecture in complexity theory, we show that for n = kc, no algorithm can achieve a k1 O(1/c) approximation in poly(n) time, providing evidence for the inherent hardness of the k-anonymity problem.

data mining, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Jun-22-2026, 20:18:12 GMT

Conferences PDF

Add feedback

Country:
- Asia (0.93)
- North America > United States (0.68)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Information Technology > Security & Privacy (0.46)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Architecture > Distributed Systems (0.64)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Natural Language (0.93)
    - Machine Learning > Statistical Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found