Diversity Measurement and Subset Selection for Instruction Tuning Datasets