AITopics | geoqa

Collaborating Authors

geoqa

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

GeoFM: Enhancing Geometric Reasoning of MLLMs via Synthetic Data Generation through Formal Language

Zhang, Yuhao, Hu, Dingxin, Yu, Tinghao, Liu, Hao, Liu, Yiting

arXiv.org Artificial IntelligenceNov-3-2025

Multi-modal Large Language Models (MLLMs) have gained significant attention in both academia and industry for their capabilities in handling multi-modal tasks. However, these models face challenges in mathematical geometric reasoning due to the scarcity of high-quality geometric data. To address this issue, synthetic geometric data has become an essential strategy. Current methods for generating synthetic geometric data involve rephrasing or expanding existing problems and utilizing predefined rules and templates to create geometric images and problems. However, these approaches often produce data that lacks diversity or is prone to noise. Additionally, the geometric images synthesized by existing methods tend to exhibit limited variation and deviate significantly from authentic geometric diagrams. To overcome these limitations, we propose GeoFM, a novel method for synthesizing geometric data. GeoFM uses formal languages to explore combinations of conditions within metric space, generating high-fidelity geometric problems that differ from the originals while ensuring correctness through a symbolic engine. Experimental results show that our synthetic data significantly outperforms existing methods. The model trained with our data surpass the proprietary GPT-4o model by 18.7\% on geometry problem-solving tasks in MathVista and by 16.5\% on GeoQA. Additionally, it exceeds the performance of a leading open-source model by 5.7\% on MathVista and by 2.7\% on GeoQA.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2510.27448

Country: North America > Mexico (0.28)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning

Chen, Jiaqi, Tang, Jianheng, Qin, Jinghui, Liang, Xiaodan, Liu, Lingbo, Xing, Eric P., Lin, Liang

arXiv.org Artificial IntelligenceJun-7-2021

Automatic math problem solving has recently attracted increasing attention as a long-standing AI benchmark. In this paper, we focus on solving geometric problems, which requires a comprehensive understanding of textual descriptions, visual diagrams, and theorem knowledge. However, the existing methods were highly dependent on handcraft rules and were merely evaluated on small-scale datasets. Therefore, we propose a Geometric Question Answering dataset GeoQA, containing 5,010 geometric problems with corresponding annotated programs, which illustrate the solving process of the given problems. Compared with another publicly available dataset GeoS, GeoQA is 25 times larger, in which the program annotations can provide a practical testbed for future research on explicit and explainable numerical reasoning. Moreover, we introduce a Neural Geometric Solver (NGS) to address geometric problems by comprehensively parsing multimodal information and generating interpretable programs. We further add multiple self-supervised auxiliary tasks on NGS to enhance cross-modal semantic representation. Extensive experiments on GeoQA validate the effectiveness of our proposed NGS and auxiliary tasks. However, the results are still significantly lower than human performance, which leaves large room for future research. Our benchmark and code are released at https://github.com/chen-judge/GeoQA .

diagram, geometric problem, proceedings, (16 more...)

arXiv.org Artificial Intelligence

2105.14517

Country: Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report (0.64)

Industry: Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning

#artificialintelligenceJun-2-2021, 00:35:22 GMT

geometric problem, geoqa, multimodal numerical reasoning, (2 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.65)

Add feedback

Geographic Question Answering: Challenges, Uniqueness, Classification, and Future Directions

Mai, Gengchen, Janowicz, Krzysztof, Zhu, Rui, Cai, Ling, Lao, Ni

arXiv.org Artificial IntelligenceMay-19-2021

As an important part of Artificial Intelligence (AI), Question Answering (QA) aims at generating answers to questions phrased in natural language. While there has been substantial progress in open-domain question answering, QA systems are still struggling to answer questions which involve geographic entities or concepts and that require spatial operations. In this paper, we discuss the problem of geographic question answering (GeoQA). We first investigate the reasons why geographic questions are difficult to answer by analyzing challenges of geographic questions. We discuss the uniqueness of geographic questions compared to general QA. Then we review existing work on GeoQA and classify them by the types of questions they can address. Based on this survey, we provide a generic classification framework for geographic questions. Finally, we conclude our work by pointing out unique future research directions for GeoQA.

geographic entity, geographic question, relation, (16 more...)

arXiv.org Artificial Intelligence

2105.09392

Country:

North America > United States > California > Santa Barbara County > Santa Barbara (0.14)
Europe > Russia (0.04)
Asia > Russia (0.04)
(36 more...)

Genre:

Overview (0.68)
Research Report (0.64)

Industry:

Education (0.93)
Consumer Products & Services (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

jacobandreas/nmn2

@machinelearnbotJun-16-2016, 11:41:05 GMT

This library provides code for training and evaluating neural module networks (NMNs). An NMN is a neural network that is assembled dynamically by composing shallow network fragments called modules into a deeper structure. These modules are jointly trained to be freely composable. At present the code supports predicting network layouts from natural-language strings, with end-to-end training of modules. Various extensions should be straightforward to implement--alternative layout predictors, supervised training of specific modules, etc. Please cite the CVPR paper for the general NMN framework, and the NAACL paper for dynamic structure selection.

artificial intelligence, directory, natural language, (11 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Natural Language (0.37)

Add feedback