Distributed Submodular Cover: Succinctly Summarizing Massive Data

Neural Information Processing Systems 

How can one find a subset, ideally as small as possible, that well represents a massive dataset? I.e., its corresponding utility, measured according to a suitable utility function, should be comparable to that of the whole dataset.