ICLEval: Evaluating In-Context Learning Ability of Large Language Models

Chen, Wentong, Lin, Yankai, Zhou, ZhenHao, Huang, HongYun, Jia, Yantao, Cao, Zhao, Wen, Ji-Rong

Jun-21-2024–arXiv.org Artificial Intelligence

In-Context Learning (ICL) is a critical capability of Large Language Models (LLMs) as it empowers them to comprehend and reason across interconnected inputs. Evaluating the ICL ability of LLMs can enhance their utilization and deepen our understanding of how this ability is acquired at the training stage. However, existing evaluation frameworks primarily focus on language abilities and knowledge, often overlooking the assessment of ICL ability. In this work, we introduce the ICLEval benchmark to evaluate the ICL abilities of LLMs, which encompasses two key sub-abilities: exact copying and rule learning. Through the ICLEval benchmark, we demonstrate that ICL ability is universally present in different LLMs, and model size is not the sole determinant of ICL efficacy. Surprisingly, we observe that ICL abilities, particularly copying, develop early in the pretraining process and stabilize afterward. Our source codes and benchmark are released at https://github.com/yiye3/ICLEval.

arxiv preprint arxiv, n-shot example, preprint arxiv, (13 more...)

arXiv.org Artificial Intelligence

Jun-21-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Queensland (0.04)
- North America > United States
  - Oregon (0.04)
  - New York (0.04)
  - Indiana > Marion County
    - Indianapolis (0.04)
  - Nevada > Clark County
    - Las Vegas (0.04)
  - Louisiana > Orleans Parish
    - New Orleans (0.04)
  - Utah > Salt Lake County
    - Salt Lake City (0.04)
  - Illinois > Cook County
    - Chicago (0.04)
  - Missouri > Jackson County
    - Kansas City (0.04)
  - Georgia > Fulton County
    - Atlanta (0.04)
  - California
    - San Francisco County > San Francisco (0.04)
    - Los Angeles County > Los Angeles (0.04)
- Asia
  - Singapore (0.04)
  - Myanmar > Yangon Region
    - Yangon (0.04)
  - India
    - Uttar Pradesh > Lucknow (0.04)
    - Maharashtra > Mumbai (0.04)
  - China
    - Gansu Province (0.04)
    - Beijing > Beijing (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Education (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found