Exploring the Mystery of Influential Data for Mathematical Reasoning

Ni, Xinzhe, Gong, Yeyun, Gou, Zhibin, Shen, Yelong, Yang, Yujiu, Duan, Nan, Chen, Weizhu

Apr-1-2024–arXiv.org Artificial Intelligence

Selecting influential data for fine-tuning on downstream tasks is a key factor for both performance and computation efficiency. Recent works have shown that training with only limited data can show a superior performance on general tasks. However, the feasibility on mathematical reasoning tasks has not been validated. To go further, there exist two open questions for mathematical reasoning: how to select influential data and what is an influential data composition. For the former one, we propose a Quality-aware Diverse Selection (QaDS) strategy adaptable for mathematical reasoning. A comparison with other selection strategies validates the superiority of QaDS. For the latter one, we first enlarge our setting and explore the influential data composition. We conduct a series of experiments and highlight: scaling up reasoning data, and training with general data selected by QaDS is helpful. Then, we define our optimal mixture as OpenMathMix, an influential data mixture with open-source data selected by QaDS. With OpenMathMix, we achieve a state-of-the-art 48.8% accuracy on MATH with 7B base model. Additionally, we showcase the use of QaDS in creating efficient fine-tuning mixtures with various selection ratios, and analyze the quality of a wide range of open-source datasets, which can perform as a reference for future works on mathematical reasoning tasks.

arxiv preprint arxiv, dataset, mathematical reasoning task, (10 more...)

arXiv.org Artificial Intelligence

Apr-1-2024

arXiv.org PDF

Add feedback

Country:
- Asia
  - Nepal (0.04)
  - Myanmar > Tanintharyi Region
    - Dawei (0.04)
  - China > Shanghai
    - Shanghai (0.04)

Genre:
- Research Report (0.82)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (0.71)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found