Revisiting Zero-Shot Abstractive Summarization in the Era of Large Language Models from the Perspective of Position Bias

Chhabra, Anshuman, Askari, Hadi, Mohapatra, Prasant

Jan-3-2024–arXiv.org Artificial Intelligence

We characterize and study zero-shot abstractive summarization in Large Language Models (LLMs) by measuring position bias, which we propose as a general formulation of the more restrictive lead bias phenomenon studied previously in the literature. Position bias captures the tendency of a model unfairly prioritizing information from certain parts of the input text over others, leading to undesirable behavior. Through numerous experiments on four diverse real-world datasets, we study position bias in multiple LLM models such as GPT 3.5-Turbo, Llama-2, and Dolly-v2, as well as state-of-the-art pretrained encoder-decoder abstractive summarization models such as Pegasus and BART. Our findings lead to novel insights and discussion on performance and position bias of models for zero-shot summarization tasks.

dataset, position bias, summarization, (14 more...)

arXiv.org Artificial Intelligence

Jan-3-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States > California > Yolo County > Davis (0.04)

Genre:
- Research Report > New Finding (0.48)

Industry:
- Information Technology (0.68)
- Media > News (0.49)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)