GistScore: Learning Better Representations for In-Context Example Selection with Gist Bottlenecks
Gupta, Shivanshu, Rosenbaum, Clemens, Elenberg, Ethan R.
–arXiv.org Artificial Intelligence
Large language models (LLMs) have the ability to perform in-context learning (ICL) of new tasks by conditioning on prompts comprising a few task examples. This work studies the problem of selecting the best examples given a candidate pool to improve ICL performance on given a test input. Existing approaches either require training with feedback from a much larger LLM or are computationally expensive. We propose a novel metric, GistScore, based on Example Gisting, a novel approach for training example retrievers for ICL using an attention bottleneck via Gisting, a recent technique for compressing task instructions. To tradeoff performance with ease of use, we experiment with both fine-tuning gist models on each dataset and multi-task training a single model on a large collection of datasets. On 21 diverse datasets spanning 9 tasks, we show that our fine-tuned models get state-of-the-art ICL performance with 20% absolute average gain over off-the-shelf retrievers and 7% over the best prior methods. Our multi-task model generalizes well out-of-the-box to new task categories, datasets, and prompt templates with retrieval speeds that are consistently thousands of times faster than the best prior training-free method.
arXiv.org Artificial Intelligence
Nov-16-2023
- Country:
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Asia
- China > Hong Kong (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Europe
- Austria (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy > Tuscany
- Florence (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- United Kingdom > England
- Cumbria (0.04)
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- California > Orange County
- Irvine (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Maryland > Montgomery County
- Gaithersburg (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New York (0.04)
- Washington > King County
- Seattle (0.14)
- California > Orange County
- Canada > Quebec
- Oceania > Australia (0.05)
- South America
- Brazil (0.04)
- Chile > Santiago Metropolitan Region
- Santiago Province > Santiago (0.04)
- Africa > Ethiopia
- Genre:
- Research Report (1.00)
- Industry:
- Energy > Renewable
- Leisure & Entertainment > Sports
- Football (1.00)
- Materials > Chemicals
- Commodity Chemicals (0.70)
- Technology: