AITopics | natural language problem

Collaborating Authors

natural language problem

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Lean Workbook: A large-scale Lean problem set formalized from natural language math problems

Neural Information Processing SystemsNov-20-2025, 03:17:36 GMT

I do believe that problems are the heart of mathematics.

artificial intelligence, large language model, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > China > Shanghai > Shanghai (0.05)

Genre:

Research Report (0.68)
Instructional Material > Course Syllabus & Notes (0.50)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Lean Workbook: A large-scale Lean problem set formalized from natural language math problems

Neural Information Processing SystemsOct-10-2025, 15:23:11 GMT

I do believe that problems are the heart of mathematics.

dataset, language model, natural language problem, (13 more...)

Neural Information Processing Systems

Country: Asia > China > Shanghai > Shanghai (0.05)

Genre:

Research Report (0.68)
Instructional Material > Course Syllabus & Notes (0.50)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

LEAN-GitHub: Compiling GitHub LEAN repositories for a versatile LEAN prover

Wu, Zijian, Wang, Jiayu, Lin, Dahua, Chen, Kai

arXiv.org Artificial IntelligenceJul-24-2024

Recently, large language models have presented promising results in aiding formal mathematical reasoning. However, their performance is restricted due to the scarcity of formal theorem-proving data, which requires additional effort to be extracted from raw formal language corpora. Meanwhile, a significant amount of human-written formal language corpora remains underutilized. To address this issue, we propose LEAN-GitHub, a dataset consisting of large-scale formal data extracted from almost all Lean 4 repositories on GitHub. After fine-tuning InternLM-math-plus on this dataset, our model achieved accuracies of 48.8% with a single pass and 54.5% with 64 passes on the Lean 4 miniF2F test, surpassing state-of-the-art method at 52%. And it also achieves state-of-the-art on two other Lean 4 benchmarks (ProofNet and Putnam) targeting different fields/levels of math. These results demonstrate that our proposed dataset is beneficial for formal reasoning on a wide range of math topics. We open-source our model at https://GitHub. com/InternLM/InternLM-Math and our data at https://huggingface.co/ datasets/InternLM/Lean-GitHub

dataset, lean-github, repository, (14 more...)

arXiv.org Artificial Intelligence

2407.17227

Country:

Asia > China > Shanghai > Shanghai (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > Germany > Berlin (0.04)
Asia > China > Hong Kong (0.04)

Genre:

Research Report > Promising Solution (0.48)
Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Lean Workbook: A large-scale Lean problem set formalized from natural language math problems

Ying, Huaiyuan, Wu, Zijian, Geng, Yihan, Wang, Jiayu, Lin, Dahua, Chen, Kai

arXiv.org Artificial IntelligenceJun-7-2024

Large language models have demonstrated impressive capabilities across various natural language processing tasks, especially in solving mathematical problems. However, large language models are not good at math theorem proving using formal languages like Lean. A significant challenge in this area is the scarcity of training data available in these formal languages. To address this issue, we propose a novel pipeline that iteratively generates and filters synthetic data to translate natural language mathematical problems into Lean 4 statements, and vice versa. Our results indicate that the synthetic data pipeline can provide useful training data and improve the performance of LLMs in translating and understanding complex mathematical problems and proofs. Our final dataset contains about 57K formal-informal question pairs along with searched proof from the math contest forum and 21 new IMO questions.

dataset, language model, natural language problem, (13 more...)

arXiv.org Artificial Intelligence

2406.03847

Country: Asia > China > Shanghai > Shanghai (0.04)

Genre:

Instructional Material > Course Syllabus & Notes (0.50)
Research Report > New Finding (0.34)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback