Goto

Collaborating Authors

 mathnet


Can you best a math Olympiad? Test your skills with the world's largest database of problems.

Popular Science

MathNet contains 30,000 free math problems collected over half a century. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. The International Mathematical Olympiad was first held in Romania in 1959. Breakthroughs, discoveries, and DIY tips sent six days a week. In 1959, countries around the world sent their most talented students to Romania to compete in the first-ever International Mathematical Olympiad (IMO).


MathNet: A Data-Centric Approach for Printed Mathematical Expression Recognition

Schmitt-Koopmann, Felix M., Huang, Elaine M., Hutter, Hans-Peter, Stadelmann, Thilo, Darvishy, Alireza

arXiv.org Artificial Intelligence

Printed mathematical expression recognition (MER) models are usually trained and tested using LaTeX-generated mathematical expressions (MEs) as input and the LaTeX source code as ground truth. As the same ME can be generated by various different LaTeX source codes, this leads to unwanted variations in the ground truth data that bias test performance results and hinder efficient learning. In addition, the use of only one font to generate the MEs heavily limits the generalization of the reported results to realistic scenarios. We propose a data-centric approach to overcome this problem, and present convincing experimental results: Our main contribution is an enhanced LaTeX normalization to map any LaTeX ME to a canonical form. Based on this process, we developed an improved version of the benchmark dataset im2latex-100k, featuring 30 fonts instead of one. Second, we introduce the real-world dataset realFormula, with MEs extracted from papers. Third, we developed a MER model, MathNet, based on a convolutional vision transformer, with superior results on all four test sets (im2latex-100k, im2latexv2, realFormula, and InftyMDB-1), outperforming the previous state of the art by up to 88.3%.