Self-Consistency Boosts Calibration for Math Reasoning

Wang, Ante, Song, Linfeng, Tian, Ye, Peng, Baolin, Jin, Lifeng, Mi, Haitao, Su, Jinsong, Yu, Dong

Mar-14-2024–arXiv.org Artificial Intelligence

Calibration, which establishes the correlation between accuracy and model confidence, is important for LLM development. We design three off-the-shelf calibration methods based on self-consistency (Wang et al., 2022) for math reasoning tasks. Evaluation on two popular benchmarks (GSM8K and MathQA) using strong open-source LLMs (Mistral and LLaMA2), our methods better bridge model confidence and accuracy than existing methods based on p(True) (Kadavath et al., 2022) or logit (Kadavath et al., 2022).

calibration, computational linguistic, word problem, (13 more...)

arXiv.org Artificial Intelligence

Mar-14-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Washington > King County > Bellevue (0.04)
- Asia
  - Taiwan (0.04)
  - China > Fujian Province
    - Xiamen (0.05)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.36)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found