OpenMathInstruct-1: A1.8 Million Math Instruction Tuning Dataset

May-29-2025, 05:37:10 GMT–Neural Information Processing Systems

Recent work has shown the immense potential of synthetically generated datasets for training large language models (LLMs), especially for acquiring targeted skills. Current large-scale math instruction tuning datasets such as MetaMathQA [1] and MAmmoTH [2] are constructed using outputs from closed-source LLMs with commercially restrictive licenses. A key reason limiting the use of open-source LLMs in these data generation pipelines has been the wide gap between the mathematical skills of the best closed-source LLMs, such as GPT-4, and the best open-source LLMs. Building on our proposed prompting novelty, the recent progress in opensource LLMs, and some brute-force scaling, we construct OpenMathInstruct-1, a high-quality math instruction tuning dataset with 1.8M problem-solution pairs. The dataset is constructed by synthesizing code-interpreter solutions for GSM8K and MATH, two popular math reasoning benchmarks, using the recently released and permissively licensed Mixtral model. Our best model, OpenMath-CodeLlama-70B, trained on a subset of OpenMathInstruct-1, achieves a score of 84.6% on GSM8K and 50.7% on MATH, which is competitive with the best gpt-distilled models. To support the open-source efforts, we have released our code, models, and the OpenMathInstruct-1 dataset under a commercially permissive license.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

May-29-2025, 05:37:10 GMT

Conferences PDF

Add feedback

Country:
- Asia (0.14)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Education > Curriculum
  - Subject-Specific Education (0.90)
- Leisure & Entertainment > Sports (0.92)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.67)
  - Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
OpenMathInstruct-1: A1.8 Million Math Instruction Tuning Dataset

Similar Docs Excel Report more

Title	Similarity	Source
None found