Diverse Sequential Subset Selection for Supervised Video Summarization