Fast networked data selection via distributed smoothed quantile estimation
Zhang, Xu, Vasconcelos, Marcos M.
–arXiv.org Artificial Intelligence
Collecting the most informative data from a large dataset distributed over a network is a fundamental problem in many fields, including control, signal processing and machine learning. In this paper, we establish a connection between selecting the most informative data and finding the top-$k$ elements of a multiset. The top-$k$ selection in a network can be formulated as a distributed nonsmooth convex optimization problem known as quantile estimation. Unfortunately, the lack of smoothness in the local objective functions leads to extremely slow convergence and poor scalability with respect to the network size. To overcome the deficiency, we propose an accelerated method that employs smoothing techniques. Leveraging the piecewise linearity of the local objective functions in quantile estimation, we characterize the iteration complexity required to achieve top-$k$ selection, a challenging task due to the lack of strong convexity. Several numerical results are provided to validate the effectiveness of the algorithm and the correctness of the theory.
arXiv.org Artificial Intelligence
Jun-3-2024
- Country:
- Asia > China
- Beijing > Beijing (0.04)
- Shaanxi Province > Xi'an (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- California (0.14)
- Maryland > Prince George's County
- College Park (0.04)
- New York > New York County
- New York City (0.04)
- Virginia (0.04)
- Asia > China
- Genre:
- Research Report (0.81)
- Industry:
- Information Technology > Security & Privacy (0.46)
- Technology: