Measuring Multimodal Mathematical Reasoning with the MA TH-Vision Dataset

Neural Information Processing Systems 

However, we observe significant limitations in the diversity of questions and breadth of subjects covered by these benchmarks.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found