Boosted Prompt Ensembles for Large Language Models

Pitis, Silviu, Zhang, Michael R., Wang, Andrew, Ba, Jimmy

Apr-12-2023–arXiv.org Artificial Intelligence

Methods such as chain-of-thought prompting and self-consistency have pushed the frontier of language model reasoning performance with no additional training. To further improve performance, we propose a prompt ensembling method for large language models, which uses a small dataset to construct a set of few shot prompts that together comprise a ``boosted prompt ensemble''. The few shot examples for each prompt are chosen in a stepwise fashion to be ``hard'' examples on which the previous step's ensemble is uncertain. We show that this outperforms single-prompt output-space ensembles and bagged prompt-space ensembles on the GSM8k and AQuA datasets, among others. We propose both train-time and test-time versions of boosted prompting that use different levels of available annotation and conduct a detailed empirical study of our algorithm.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

Apr-12-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States (0.04)
  - Canada > Ontario
    - Toronto (0.14)

Genre:
- Research Report (0.64)

Industry:
- Education (0.68)
- Leisure & Entertainment > Sports
  - Basketball (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found