Measuring Mathematical Problem Solving With the MATH Dataset