Symbol tuning improves in-context learning in language models
Wei, Jerry, Hou, Le, Lampinen, Andrew, Chen, Xiangning, Huang, Da, Tay, Yi, Chen, Xinyun, Lu, Yifeng, Zhou, Denny, Ma, Tengyu, Le, Quoc V.
–arXiv.org Artificial Intelligence
We present symbol tuning - finetuning language models on in-context input-label pairs where natural language labels (e.g., "positive/negative sentiment") are replaced with arbitrary symbols (e.g., "foo/bar"). Symbol tuning leverages the intuition that when a model cannot use instructions or natural language labels to figure out a task, it must instead do so by learning the input-label mappings. We experiment with symbol tuning across Flan-PaLM models up to 540B parameters and observe benefits across various settings. First, symbol tuning boosts performance on unseen in-context learning tasks and is much more robust to underspecified prompts, such as those without instructions or without natural language labels. Second, symbol-tuned models are much stronger at algorithmic reasoning tasks, with up to 18.2% better performance on the List Functions benchmark and up to 15.3% better performance on the Simple Turing Concepts benchmark. Finally, symbol-tuned models show large improvements in following flipped-labels presented in-context, meaning that they are more capable of using in-context information to override prior semantic knowledge.
arXiv.org Artificial Intelligence
Dec-30-2023
- Country:
- Africa
- Côte d'Ivoire > Abidjan
- Abidjan (0.04)
- Kenya (0.04)
- Nigeria (0.04)
- South Africa (0.04)
- Côte d'Ivoire > Abidjan
- Asia
- Pakistan (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- South Korea > Daejeon
- Daejeon (0.04)
- Kazakhstan (0.04)
- India > Karnataka
- Bengaluru (0.04)
- Middle East
- Iran (0.04)
- Iraq > Baghdad Governorate
- Baghdad (0.04)
- Republic of Türkiye (0.04)
- Russia (0.14)
- China
- Taiwan (0.04)
- Afghanistan (0.04)
- Europe
- Italy > Emilia-Romagna
- Metropolitan City of Bologna > Bologna (0.04)
- United Kingdom (0.27)
- Eastern Europe (0.04)
- Belgium > Flanders
- Flemish Brabant > Leuven (0.04)
- Middle East > Cyprus (0.04)
- Russia (0.14)
- France
- Pays de la Loire > Loire-Atlantique
- Nantes (0.04)
- Île-de-France > Paris
- Paris (0.04)
- Pays de la Loire > Loire-Atlantique
- Germany (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Spain > Galicia
- Madrid (0.04)
- Italy > Emilia-Romagna
- North America
- Canada
- Manitoba > Winnipeg Metropolitan Region
- Winnipeg (0.04)
- Ontario > Toronto (0.04)
- Manitoba > Winnipeg Metropolitan Region
- Cuba > La Habana Province
- Havana (0.04)
- United States
- Wisconsin > Milwaukee County
- Milwaukee (0.04)
- New York > New York County
- New York City (0.04)
- California
- Los Angeles County > Los Angeles (0.04)
- San Francisco County > San Francisco (0.14)
- New Jersey > Monmouth County
- Asbury Park (0.04)
- Kansas > Wyandotte County
- Kansas City (0.04)
- Washington > King County
- Seattle (0.04)
- Illinois
- Champaign County > Urbana (0.04)
- Cook County > Chicago (0.04)
- Alaska > Denali Borough
- Mt Mckinley (0.04)
- Oklahoma (0.04)
- Missouri > Jackson County
- Kansas City (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Arizona (0.04)
- Texas
- Dallas County > Dallas (0.04)
- Travis County > Austin (0.04)
- Utah (0.04)
- Florida (0.04)
- Wisconsin > Milwaukee County
- Canada
- Oceania
- Australia (0.04)
- Marshall Islands (0.04)
- Micronesia (0.04)
- New Zealand (0.04)
- Africa
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Leisure & Entertainment > Sports
- Baseball (0.92)
- Media
- Banking & Finance > Trading (0.67)
- Automobiles & Trucks (0.92)
- Education (0.68)
- Materials (0.92)
- Semiconductors & Electronics (1.00)
- Government
- Immigration & Customs (1.00)
- Military (1.00)
- Regional Government
- Europe Government (0.67)
- North America Government > United States Government (1.00)
- Health & Medicine
- Consumer Health (0.92)
- Pharmaceuticals & Biotechnology (1.00)
- Therapeutic Area
- Immunology (1.00)
- Infections and Infectious Diseases (0.67)
- Energy (0.67)
- Information Technology (1.00)
- Consumer Products & Services (1.00)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Law > Statutes (0.67)
- Leisure & Entertainment > Sports
- Technology: