AITopics | Materials

Collaborating Authors

Materials

Constrained Decoding of Diffusion LLMs with Context-Free Grammars

Mündler, Niels, Dekoninck, Jasper, Vechev, Martin

arXiv.org Artificial IntelligenceAug-18-2025

Large language models (LLMs) have shown promising performance across diverse domains. Many practical applications of LLMs, such as code completion and structured data extraction, require adherence to syntactic constraints specified by a formal language. Yet, due to their probabilistic nature, LLM output is not guaranteed to adhere to such formal languages. Prior work has proposed constrained decoding as a means to restrict LLM generation to particular formal languages. However, existing works are not applicable to the emerging paradigm of diffusion LLMs, when used in practical scenarios such as the generation of formally correct C++ or JSON output. In this paper we address this challenge and present the first constrained decoding method for diffusion models, one that can handle formal languages captured by context-free grammars. We begin by reducing constrained decoding to the more general additive infilling problem, which asks whether a partial output can be completed to a valid word in the target language. This problem also naturally subsumes the previously unaddressed multi-region infilling constrained decoding. We then reduce this problem to the task of deciding whether the intersection of the target language and a regular language is empty and present an efficient algorithm to solve it for context-free languages. Empirical results on various applications, such as C++ code infilling and structured data extraction in JSON, demonstrate that our method achieves near-perfect syntactic correctness while consistently preserving or improving functional correctness. Importantly, our efficiency optimizations ensure that the computational overhead remains practical.

completion, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2508.10111

Country: Europe > Switzerland (0.28)

Genre: Research Report (0.64)

Industry: Materials > Chemicals > Commodity Chemicals (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

a9b3d7f65eebb083e5c7f8cf10e52528-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsAug-17-2025, 13:04:02 GMT

machine learning, reinforcement, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > Canada > Alberta (0.14)
Asia > Middle East > Jordan (0.04)
(4 more...)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Energy (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.94)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

A More Experimental Results of Empirical Exploration

Neural Information Processing SystemsAug-17-2025, 11:16:11 GMT

These observations suggest the existence of a tradeoff between average robustness and robust fairness. We use var and rob.acc to denote the variance of class-wise robust accuracy, and average robust accuracy, respectively. We use var and rob.acc to denote the variance of class-wise robust accuracy, and average robust accuracy, respectively. B.1 Naturally Trained Linear model We use var and rob.acc to denote the variance of class-wise robust accuracy, and average robust accuracy, respectively. For any classifier f ( x) in Equation ( 2), we first calculate its natural risk.

artificial intelligence, machine learning, robust accuracy, (15 more...)

Neural Information Processing Systems

Industry:

Materials > Chemicals > Industrial Gases > Liquified Gas (0.50)
Materials > Chemicals > Commodity Chemicals > Petrochemicals > LNG (0.50)
Energy > Oil & Gas > Midstream (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

9c7008aff45b5d8f0973b23e1a22ada0-Paper-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 06:10:54 GMT

arxiv preprint arxiv, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Materials (0.93)
Leisure & Entertainment > Games > Computer Games (0.49)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

b21f9f98829dea9a48fd8aaddc1f159d-Supplemental.pdf

Neural Information Processing SystemsAug-16-2025, 22:41:52 GMT

artificial intelligence, dataset, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > New York (0.04)
North America > United States > New Jersey (0.04)
Asia > Japan (0.04)

Genre: Research Report (0.46)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.93)
Materials > Chemicals (0.93)
Health & Medicine > Therapeutic Area > Immunology (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)

Add feedback

Likelihood-Based Diffusion Language Models

Neural Information Processing SystemsAug-16-2025, 10:01:51 GMT

Large language models lie at the center of recent advances in artificial intelligence.

large language model, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom (0.67)
Asia (0.45)
Europe > France (0.28)
(6 more...)

Genre: Research Report > New Finding (0.92)

Industry:

Media (1.00)
Leisure & Entertainment > Sports (1.00)
Law > Statutes (1.00)
(12 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

317470b3fde29f3bb8d6dee563afffc4-Paper-Conference.pdf

Neural Information Processing SystemsAug-16-2025, 00:04:04 GMT

artificial intelligence, machine learning, mesh, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Texas (0.14)

Genre: Research Report > New Finding (0.46)

Industry:

Materials > Chemicals (0.47)
Energy > Oil & Gas (0.46)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Data Science (0.68)

Add feedback

25cd345233c65fac1fec0ce61d0f7836-Supplemental-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsAug-15-2025, 23:45:04 GMT

artificial intelligence, deep learning, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States (0.68)

Genre: Research Report > Experimental Study (0.46)

Industry:

Materials > Chemicals > Industrial Gases > Liquified Gas (1.00)
Materials > Chemicals > Commodity Chemicals > Petrochemicals > LNG (1.00)
Energy > Oil & Gas > Midstream (1.00)
Health & Medicine (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.83)

Add feedback

8466f9ace6a9acbe71f75762ffc890f1-Paper.pdf

Neural Information Processing SystemsAug-15-2025, 14:39:08 GMT

machine learning, natural language, translation, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Spain (0.04)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Materials > Metals & Mining (0.46)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Appendix A Patch based Negative Data Augmentation Reduces Texture Bias

Neural Information Processing SystemsAug-15-2025, 12:23:12 GMT

Figure 5: ViTs trained only on our patch-based transformations exhibit stronger texture bias. Each bar is the texture accuracy ( %) on Conflict Stimuli (Geirhos et al., 2018), and a higher texture accuracy indicates the model has a higher bias towards texture. The "texture accuracy" is defined as the percentage of images that are classified as the "texture" label, provided the image is classified as either "texture" or "shape" label. The baseline model is ViT -B/16 in (Dosovitskiy et al., 2021) trained on original images. Other models are trained on patch-based transformed images, e.g., "P-Shuffle" stands for a ViT -B/16 model trained on patch-based shuffled images.

artificial intelligence, machine learning, vit-b 16, (14 more...)

Neural Information Processing Systems

Industry:

Materials > Chemicals > Industrial Gases > Liquified Gas (0.31)
Materials > Chemicals > Commodity Chemicals > Petrochemicals > LNG (0.31)
Energy > Oil & Gas > Midstream (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.33)

Add feedback