Active Example Selection for In-Context Learning
Zhang, Yiming, Feng, Shi, Tan, Chenhao
–arXiv.org Artificial Intelligence
With a handful of demonstration examples, large-scale language models show strong capability to perform various tasks by in-context learning from these examples, without any fine-tuning. We demonstrate that in-context learning performance can be highly unstable across samples of examples, indicating the idiosyncrasies of how language models acquire information. We formulate example selection for in-context learning as a sequential decision problem, and propose a reinforcement learning algorithm for identifying generalizable policies to select demonstration examples. For GPT-2, our learned policies demonstrate strong abilities of generalizing to unseen tasks in training, with a $5.8\%$ improvement on average. Examples selected from our learned policies can even achieve a small improvement on GPT-3 Ada. However, the improvement diminishes on larger GPT-3 models, suggesting emerging capabilities of large language models.
arXiv.org Artificial Intelligence
Nov-8-2022
- Country:
- Oceania > Australia
- North America > United States
- Washington > King County
- Seattle (0.04)
- New York > New York County
- New York City (0.04)
- New Jersey > Mercer County
- Princeton (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Illinois > Cook County
- Chicago (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- California > San Francisco County
- San Francisco (0.14)
- Washington > King County
- Europe
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Ireland > Leinster
- Asia
- Middle East > Jordan (0.04)
- Japan > Honshū
- Chūbu > Toyama Prefecture > Toyama (0.04)
- Genre:
- Research Report > New Finding (0.68)
- Technology: