Leveraging Sparsity for Efficient Submodular Data Summarization

May-27-2025, 20:58:09 GMT–Neural Information Processing Systems

The facility location problem is widely used for summarizing large datasets and has additional applications in sensor placement, image retrieval, and clustering. One difficulty of this problem is that submodular optimization algorithms require the calculation of pairwise benefits for all items in the dataset. This is infeasible for large problems, so recent work proposed to only calculate nearest neighbor benefits. One limitation is that several strong assumptions were invoked to obtain provable approximation guarantees. In this paper we establish that these extra assumptions are not necessary--solving the sparsified problem will be almost optimal under the standard assumptions of the problem.

artificial intelligence, efficient submodular data summarization, leveraging sparsity, (1 more...)

Neural Information Processing Systems

May-27-2025, 20:58:09 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Artificial Intelligence > Representation & Reasoning (0.71)
  - Sensing and Signal Processing (0.65)