Functionality learning through specification instructions
de Araujo, Pedro Henrique Luz, Roth, Benjamin
–arXiv.org Artificial Intelligence
Test suites assess natural language processing models' performance on specific functionalities: cases of interest involving model robustness, fairness, or particular linguistic capabilities. They enable fine-grained evaluations of model aspects that would otherwise go unnoticed in standard evaluation datasets, but they do not address the problem of how to fix the failure cases. Previous work has explored functionality learning by fine-tuning models on suite data. While this improves performance on seen functionalities, it often does not generalize to unseen ones and can harm general performance. This paper analyses a fine-tuning-free approach to functionality learning. For each functionality in a suite, we generate a specification instruction that encodes it. We combine the obtained specification instructions to create specification-augmented prompts, which we feed to language models pre-trained on natural instruction data to generate suite predictions. A core aspect of our analysis is to measure the effect that including a set of specifications has on a held-out set of unseen, qualitatively different specifications. Our experiments across four tasks and models ranging from 80M to 175B parameters show that smaller models struggle to follow specification instructions. However, larger models (> 3B params.) can benefit from specifications and even generalize desirable behaviors across functionalities.
arXiv.org Artificial Intelligence
Nov-14-2023
- Country:
- Asia
- China > Hong Kong (0.04)
- India (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Pakistan (0.04)
- Philippines (0.04)
- Vietnam > Long An Province (0.04)
- Europe
- Austria > Vienna (0.14)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- North America
- Canada > Ontario
- Toronto (0.04)
- Dominican Republic (0.04)
- Mexico (0.04)
- United States
- Arizona (0.04)
- Colorado (0.04)
- Georgia
- Bartow County (0.04)
- Rockdale County (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Nevada (0.04)
- Texas (0.04)
- Washington > King County
- Seattle (0.14)
- Canada > Ontario
- South America
- Asia
- Genre:
- Research Report
- Experimental Study (0.46)
- New Finding (0.45)
- Research Report
- Industry:
- Education > Educational Setting
- K-12 Education (0.67)
- Government (0.92)
- Health & Medicine > Therapeutic Area
- Immunology (0.46)
- Infections and Infectious Diseases (0.46)
- Information Technology (0.67)
- Law Enforcement & Public Safety (0.67)
- Leisure & Entertainment (1.00)
- Media (1.00)
- Education > Educational Setting
- Technology: