Data Summarization at Scale: A Two-Stage Submodular Approach