Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models
Han, Xudong, Yang, Junjie, Wang, Tianyang, Bi, Ziqian, Song, Xinyuan, Hao, Junfeng, Song, Junhao
–arXiv.org Artificial Intelligence
Instruction tuning is a pivotal technique for aligning large language models (LLMs) with human intentions, safety constraints, and domain-specific requirements. This survey provides a comprehensive overview of the full pipeline, encompassing (i) data collection methodologies, (ii) full-parameter and parameter-efficient fine-tuning strategies, and (iii) evaluation protocols. We categorized data construction into three major paradigms: expert annotation, distillation from larger models, and self-improvement mechanisms, each offering distinct trade-offs between quality, scalability, and resource cost. Fine-tuning techniques range from conventional supervised training to lightweight approaches, such as low-rank adaptation (LoRA) and prefix tuning, with a focus on computational efficiency and model reusability. We further examine the challenges of evaluating faithfulness, utility, and safety across multilingual and multimodal scenarios, highlighting the emergence of domain-specific benchmarks in healthcare, legal, and financial applications. Finally, we discuss promising directions for automated data generation, adaptive optimization, and robust evaluation frameworks, arguing that a closer integration of data, algorithms, and human feedback is essential for advancing instruction-tuned LLMs. This survey aims to serve as a practical reference for researchers and practitioners seeking to design LLMs that are both effective and reliably aligned with human intentions.
arXiv.org Artificial Intelligence
Nov-20-2025
- Country:
- Europe > United Kingdom (0.28)
- Genre:
- Overview (1.00)
- Research Report > New Finding (0.92)
- Industry:
- Information Technology (1.00)
- Education (1.00)
- Health & Medicine > Therapeutic Area
- Endocrinology > Diabetes (0.67)
- Technology: