97fe251c25b6f99a2a23b330a75b11d4-Paper-Conference.pdf

Neural Information Processing Systems 

Despite the effectiveness of data selection for pretraining and instruction fine-tuning large language models (LLMs), improving data efficiency in supervised fine-tuning (SFT) for specialized domains poses significant challenges due to the complexity of fine-tuning data.