A Survey on Data Selection for LLM Instruction Tuning