Learning to Retrieve In-Context Examples for Large Language Models

Jul-14-2023–arXiv.org Artificial Intelligence

Large language models (LLMs) have demonstrated their ability to learn in-context, allowing them to perform various tasks based on a few input-output examples. However, the effectiveness of in-context learning is heavily reliant on the quality of the selected examples. In this paper, we propose a novel framework to iteratively train dense retrievers that can identify high-quality in-context examples for LLMs. Our framework initially trains a reward model based on LLM feedback to evaluate the quality of candidate examples, followed by knowledge distillation to train a bi-encoder based dense retriever. Our experiments on a suite of 30 tasks demonstrate that our framework significantly enhances in-context learning performance. Furthermore, we show the generalization ability of our framework to unseen tasks during training. An in-depth analysis reveals that our model improves performance by retrieving examples with similar patterns, and the gains are consistent across LLMs of varying sizes.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Jul-14-2023

arXiv.org PDF

Add feedback

Country:
- North America > Dominican Republic (0.04)
- Europe
  - Austria (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
- Asia > China
  - Hong Kong (0.04)

Genre:
- Research Report (0.82)

Industry:
- Information Technology (0.68)
- Leisure & Entertainment > Sports (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found