Can Models Learn Skill Composition from Examples?

May-27-2025, 14:17:50 GMT–Neural Information Processing Systems

As large language models (LLMs) become increasingly advanced, their ability to exhibit compositional generalization---the capacity to combine learned skills in novel ways not encountered during training---has garnered significant attention. This type of generalization, particularly in scenarios beyond training data, is also of great interest in the study of AI safety and alignment. A recent study introduced the Skill-Mix evaluation, where models are tasked with composing a short paragraph demonstrating the use of a specified k -tuple of language skills. While small models struggled with composing even with k 3, larger models like GPT-4 performed reasonably well with k 5 and 6 .In this paper, we employ a setup akin to Skill-Mix to evaluate the capacity of smaller models to learn compositional generalization from examples. Utilizing a diverse set of language skills---including rhetorical, literary, reasoning, theory of mind, and common sense---GPT was used to generate text samples that exhibit random subsets of k skills.

compositional generalization, generalization, model learn skill composition, (3 more...)

Neural Information Processing Systems

May-27-2025, 14:17:50 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.57)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.97)