TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data
Zhang, Jipeng, Qin, Yaxuan, Pi, Renjie, Zhang, Weizhong, Pan, Rui, Zhang, Tong
–arXiv.org Artificial Intelligence
Instruction tuning [Wei et al., 2022a, Ouyang et al., 2022] is the most important strategy for customizing Large Language Models (LLMs) for downstream tasks, which allows them to precisely understand human intentions and accurately generate responses in natural languages. Recently, many existing works Wang et al. [2023a] expand the amount and diversity of instructions for instruction tuning to further enhance the LLM's capability. However, the increased quantity of the dataset also leads to significantly higher computational costs for instruction tuning. Meanwhile, Zhou et al. [2023] revealed that only 1,000 high-quality, human-created data samples could substantially improve the ability of LLMs to follow instructions, which suggest that there exists severe redundancy in current instruction datasets, and only a high-quality subset may suffice for achieving promising performance. To address the above issue, selecting a small, highly informative subset (i.e., coreset) of training samples from the original dataset is a promising solution. This approach ensures that training on the coreset achieves performance comparable to the full dataset while significantly reducing costs. However, coreset selection is challenging as it must not only consider the quality of individual samples, but also their importance within the entire subset. For example, if two high-quality samples are very similar, selecting only one may be sufficient. This global perspective on sample importance is crucial for the quality of the selected subset.
arXiv.org Artificial Intelligence
Jul-21-2024
- Country:
- North America
- United States
- Louisiana > Orleans Parish
- New Orleans (0.05)
- Illinois > Champaign County
- Urbana (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- California > San Diego County
- San Diego (0.04)
- Louisiana > Orleans Parish
- Canada > Ontario
- Toronto (0.04)
- United States
- Asia
- China > Hong Kong (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- North America
- Genre:
- Research Report > New Finding (0.46)
- Technology: