AITopics | multinomial diffusion

Autoregressive models and their sequential factorization of the data likelihood have recently demonstrated great potential for image representation and synthesis. Nevertheless, they incorporate image context in a linear 1D order by attending only to previously synthesized image patches above or to the left. Not only is this unidirectional, sequential bias of attention unnatural for images as it disregards large parts of a scene until synthesis is almost complete. It also processes the entire image on a single scale, thus ignoring more global contextual information up to the gist of the entire scene. As a remedy we incorporate a coarse-to-fine hierarchy of context by combining the autoregressive formulation with a multinomial diffusion process: Whereas a multistage diffusion process successively compresses and removes information to coarsen an image, we train a Markov chain to invert this process. In each stage, the resulting autoregressive ImageBART model progressively incorporates context from previous stages in a coarse-to-fine manner. Experiments demonstrate the gain over current autoregressive models, continuous diffusion probabilistic models, and latent variable models. Moreover, the approach enables to control the synthesis process and to trade compression rate against reconstruction accuracy, while still guaranteeing visually plausible results.

autoregressive image synthesis, bidirectional context, multinomial diffusion, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.60)

Add feedback

Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time

Neural Information Processing SystemsOct-10-2025, 15:38:24 GMT

Diffusion-based generative models, as first introduced by Sohl-Dickstein et al. (2015), have shown

diffusion, diffusion model, transition time, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Asia > Middle East > Oman (0.04)
North America > United States > Maryland > Baltimore (0.04)
Europe > Germany > Berlin (0.04)

Genre:

Research Report > Experimental Study (0.92)
Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.45)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.45)

Add feedback

67d96d458abdef21792e6d8e590244e7-Paper.pdf

Neural Information Processing SystemsAug-14-2025, 22:33:45 GMT

argmax flow, diffusion model, international conference, (11 more...)

Neural Information Processing Systems

Country:

North America > Mexico > Mexico City > Mexico City (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Europe > Denmark (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis

Neural Information Processing SystemsOct-9-2024, 16:55:43 GMT

Autoregressive models and their sequential factorization of the data likelihood have recently demonstrated great potential for image representation and synthesis. Nevertheless, they incorporate image context in a linear 1D order by attending only to previously synthesized image patches above or to the left. Not only is this unidirectional, sequential bias of attention unnatural for images as it disregards large parts of a scene until synthesis is almost complete. It also processes the entire image on a single scale, thus ignoring more global contextual information up to the gist of the entire scene. As a remedy we incorporate a coarse-to-fine hierarchy of context by combining the autoregressive formulation with a multinomial diffusion process: Whereas a multistage diffusion process successively compresses and removes information to coarsen an image, we train a Markov chain to invert this process. In each stage, the resulting autoregressive ImageBART model progressively incorporates context from previous stages in a coarse-to-fine manner.

autoregressive image synthesis, bidirectional context, multinomial diffusion, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.43)

Add feedback

Fast Sampling via De-randomization for Discrete Diffusion Models

Chen, Zixiang, Yuan, Huizhuo, Li, Yongqian, Kou, Yiwen, Zhang, Junkai, Gu, Quanquan

arXiv.org Machine LearningDec-14-2023

Diffusion models have emerged as powerful tools for high-quality data generation, such as image generation. Despite its success in continuous spaces, discrete diffusion models, which apply to domains such as texts and natural languages, remain under-studied and often suffer from slow generation speed. In this paper, we propose a novel de-randomized diffusion process, which leads to an accelerated algorithm for discrete diffusion models. Our technique significantly reduces the number of function evaluations (i.e., calls to the neural network), making the sampling process much faster. Furthermore, we introduce a continuous-time (i.e., infinite-step) sampling algorithm that can provide even better sample qualities than its discrete-time (finite-step) counterpart. Extensive experiments on natural language generation and machine translation tasks demonstrate the superior performance of our method in terms of both generation speed and sample quality over existing methods for discrete diffusion models.

diffusion model, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

2312.09193

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
North America > United States > Maryland > Baltimore (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
(2 more...)

Genre: Research Report > New Finding (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Argmax Flows and Multinomial Diffusion: Towards Non-Autoregressive Language Models

Hoogeboom, Emiel, Nielsen, Didrik, Jaini, Priyank, Forré, Patrick, Welling, Max

arXiv.org Machine LearningFeb-10-2021

The field of language modelling has been largely dominated by autoregressive models, for which sampling is inherently difficult to parallelize. This paper introduces two new classes of generative models for categorical data such as language or image segmentation: Argmax Flows and Multinomial Diffusion. Argmax Flows are defined by a composition of a continuous distribution (such as a normalizing flow), and an argmax function. To optimize this model, we learn a probabilistic inverse for the argmax that lifts the categorical data to a continuous space. Multinomial Diffusion gradually adds categorical noise in a diffusion process, for which the generative denoising process is learned. We demonstrate that our models perform competitively on language modelling and modelling of image segmentation maps.

argmax flow, diffusion, multinomial diffusion, (13 more...)

arXiv.org Machine Learning

2102.05379

Country:

North America > Mexico > Mexico City > Mexico City (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Europe > Spain (0.04)
Europe > Denmark (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Filters

Collaborating Authors

multinomial diffusion

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time

67d96d458abdef21792e6d8e590244e7-Paper.pdf

ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis

Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time

67d96d458abdef21792e6d8e590244e7-Paper.pdf

ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis

Fast Sampling via De-randomization for Discrete Diffusion Models

Argmax Flows and Multinomial Diffusion: Towards Non-Autoregressive Language Models