AITopics | isabelle

Collaborating Authors

isabelle

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving

Neural Information Processing SystemsApr-28-2026, 16:05:26 GMT

Formal verification (FV) has witnessed growing significance with current emerging program synthesis by the evolving large language models (LLMs). However, current formal verification mainly resorts to symbolic verifiers or hand-craft rules, resulting in limitations for extensive and flexible verification. On the other hand, formal languages for automated theorem proving, such as Isabelle, as another line of rigorous verification, are maintained with comprehensive rules and theorems. In this paper, we propose FVEL, an interactive Formal Verification Environment with LLMs. Specifically, FVEL transforms a given code to be verified into Isabelle, and then conducts verification via neural automated theorem proving with an LLM. The joined paradigm leverages the rigorous yet abundant formulated and organized rules in Isabelle and is also convenient for introducing and adjusting cutting-edge LLMs. To achieve this goal, we extract a large-scale FVELER. The FVELER dataset includes code dependencies and verification processes that are formulated in Isabelle, containing 758 theories, 29,304 lemmas, and 201,498 proof steps in total with in-depth dependencies.

large language model, logic & formal reasoning, natural language, (13 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Multi-language Diversity Benefits Autoformalization

Neural Information Processing SystemsFeb-16-2026, 20:30:32 GMT

Autoformalization is the task of translating natural language materials into machine-verifiable formalisations. Progress in autoformalization research is hindered by the lack of a sizeable dataset consisting of informal-formal pairs expressing the same essence.

large language model, logic & formal reasoning, machine learning, (21 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(7 more...)

Genre: Research Report > Experimental Study (1.00)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.95)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

62c6d7893b13a13c659cb815852dd00d-Supplemental-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsFeb-15-2026, 10:41:27 GMT

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > China > Hong Kong (0.04)
Europe > Italy > Lazio > Rome (0.04)
(2 more...)

Industry: Law (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.99)

Add feedback

62c6d7893b13a13c659cb815852dd00d-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsFeb-15-2026, 10:41:24 GMT

large language model, machine learning, programming language, (25 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Asia > China > Hong Kong (0.04)
(13 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

d0c6bc641a56bebee9d985b937307367-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 01:32:26 GMT

Asuccessful autoformalization system could advance the fields of formal verification, program synthesis, and artificial intelligence. While the long-term goal of autoformalization seemed elusive for a long time, we show large language models provide new prospects towards this goal.

logic & formal reasoning, machine learning, urlhttp, (18 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
Europe > Germany > Berlin (0.04)
North America > United States > Montana (0.04)
(9 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.89)

Add feedback

onthePutnamMathematicalCompetition

Neural Information Processing SystemsFeb-8-2026, 09:37:16 GMT

Automating mathematical reasoning is a longstanding goal in artificial intelligence (Newell et al., 1957).

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Thor: WieldingHammerstoIntegrateLanguage ModelsandAutomatedTheoremProvers

Neural Information Processing SystemsFeb-8-2026, 08:04:51 GMT

In theorem proving, the task of selecting useful premises from alarge library to unlock the proof of a given conjecture is crucially important. This presents a challenge foralltheorem provers,especially theonesbasedonlanguage models, due to their relative inability to reason over huge volumes of premises in text form.

logic & formal reasoning, machine learning, urlhttp, (20 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Bremen > Bremen (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
Europe > United Kingdom > Scotland > City of Edinburgh > Edinburgh (0.04)
(3 more...)

Genre: Research Report (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.48)

Add feedback

The Brilliant New Movie About Alexander Skarsg em å /em rd Making Dudley Dursley His Toy

SlateFeb-6-2026, 15:30:02 GMT

Fans of will be happy to hear that there's been another entry into the world of scintillating gay romance. The film stars noted on-screen sex haver Alexander Skarsgård--he's equally provocative in the NC-17-rated --and some guy named Harry Melling, who seems to have been in . Melling plays Colin, a certified beta whose deepest desire is to serve. He gets his wish when he meets Ray (Skarsgård), a toppy, Tom of Finland -esque biker with an attitude so icy it could preserve food. The two enter into a full-time power-exchange relationship that fuels both of their desires, until their connection evolves to a heart-wrenching breaking point. Unlike other recent films about kink that were bound and gagged by their own corniness--think and -- has been lauded as realistic, sophisticated, and smart, and the movie is currently sitting at 100 percent on Rotten Tomatoes . Still, was it enough to satisfy senior editor Isabelle Kohn and How to Do It columnist Rich Juzwiak? Be a good boy and find out.

artificial intelligence, colin, ray, (12 more...)

Slate

Country: Europe > Finland (0.24)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Communications (0.48)
Information Technology > Artificial Intelligence (0.34)

Add feedback

Evaluating Autoformalization Robustness via Semantically Similar Paraphrasing

Moore, Hayden, Shah, Asfahan

arXiv.org Artificial IntelligenceDec-5-2025

Large Language Models (LLMs) have recently emerged as powerful tools for autoformalization. Despite their impressive performance, these models can still struggle to produce grounded and verifiable formalizations. Recent work in text-to-SQL, has revealed that LLMs can be sensitive to paraphrased natural language (NL) inputs, even when high degrees of semantic fidelity are preserved (Safarzadeh, Oroo-jlooyjadid, and Roth 2025). In this paper, we investigate this claim in the autoformalization domain. Specifically, we evaluate the robustness of LLMs generating formal proofs with semantically similar paraphrased NL statements by measuring semantic and compilation validity. Using the formal benchmarks MiniF2F (Zheng, Han, and Polu 2021) and Lean 4 version of ProofNet (Xin et al. 2024), and two modern LLMs, we generate paraphrased natural language statements and cross-evaluate these statements across both models. The results of this paper reveal performance variability across paraphrased inputs, demonstrating that minor shifts in NL statements can significantly impact model outputs.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2511.12784

Genre: Research Report (0.83)

Technology: