In-Context Exemplars as Clues to Retrieving from Large Associative Memory

Dec-18-2023–arXiv.org Artificial Intelligence

In recent years, large language models (LLMs) have garnered significant attention due to their ability to revolutionize natural language processing (NLP) by demonstrating impressive language understanding and reasoning capabilities (7; 6; 45; 56; 44). LLMs are first pretrained on extensive data using the language modeling technique where the model predicts the next token given a context. Without finetuning on task-specific data, LLMs leverage in-context learning (ICL), also referred to as few-shot prompting, to make predictions. Through ICL, LLMs can find underlying patterns of the input query through given in-context exemplars, such as a set of input/output pairs, and use them to complete the response. However, the effects of in-context exemplars on downstream performance via ICL and guidelines for formulating those exemplars (e.g., how to select exemplars and how many exemplars to use) remain unclear.

context pattern, exemplar, selection, (14 more...)

arXiv.org Artificial Intelligence

Dec-18-2023

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East
  - Jordan (0.04)
- North America > United States
  - Colorado (0.04)
  - Massachusetts > Hampshire County
    - Amherst (0.04)

Genre:
- Research Report (0.40)

Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.47)
  - Natural Language > Large Language Model (1.00)