ScatterShot: Interactive In-context Example Curation for Text Transformation
Wu, Tongshuang, Shen, Hua, Weld, Daniel S., Heer, Jeffrey, Ribeiro, Marco Tulio
–arXiv.org Artificial Intelligence
The in-context learning capabilities of LLMs like GPT-3 allow annotators to customize an LLM to their specific tasks with a small number of examples. However, users tend to include only the most obvious patterns when crafting examples, resulting in underspecified in-context functions that fall short on unseen cases. Further, it is hard to know when "enough" examples have been included even for known patterns. In this work, we present ScatterShot, an interactive system for building high-quality demonstration sets for in-context learning. ScatterShot iteratively slices unlabeled data into task-specific patterns, samples informative inputs from underexplored or not-yet-saturated slices in an active learning manner, and helps users label more efficiently with the help of an LLM and the current example set. In simulation studies on two text perturbation scenarios, ScatterShot sampling improves the resulting few-shot functions by 4-5 percentage points over random sampling, with less variance as more examples are added. In a user study, ScatterShot greatly helps users in covering different patterns in the input space and labeling in-context examples more efficiently, resulting in better in-context learning and less user effort.
arXiv.org Artificial Intelligence
Feb-14-2023
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.05)
- North America
- Dominican Republic (0.04)
- United States
- Texas > Travis County
- Austin (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > Los Angeles County
- Long Beach (0.14)
- Wisconsin > Dane County
- Madison (0.04)
- Washington > King County
- Seattle (0.04)
- New York > New York County
- New York City (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Texas > Travis County
- Canada
- Europe
- Germany > Berlin (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Italy
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Asia
- China > Hong Kong (0.04)
- Singapore (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Oceania > Australia
- Genre:
- Questionnaire & Opinion Survey (0.86)
- Research Report
- New Finding (0.46)
- Experimental Study (0.46)
- Industry:
- Health & Medicine (0.68)
- Education (0.46)
- Technology: