AITopics | discrete latent variable

Itcomeswiththeadvantages ofVAEs, such asstable training, largesample diversity and aprincipled inference network, while having the flexibility to model a combination of continuous and discrete generative factors.

artificial intelligence, latent variable, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.70)

Add feedback

48cb136b65a69e8c2aa22913a0d91b2f-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 12:06:50 GMT

facet, international conference, latent variable, (12 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.68)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science > Data Mining (0.94)

Add feedback

Paraphrase Generation with Latent Bag of Words

Neural Information Processing SystemsDec-25-2025, 11:01:39 GMT

Paraphrase generation is a longstanding important problem in natural language processing. Recent progress in deep generative models has shown promising results on discrete latent variables for text generation. Inspired by variational autoencoders with discrete latent structures, in this work, we propose a latent bag of words (BOW) model for paraphrase generation. We ground the semantics of a discrete latent variable by the target BOW. We use this latent variable to build a fully differentiable content planning and surface realization pipeline. Specifically, we use source words to predict their neighbors and model the target BOW with a mixture of softmax.

latent bag, name change, paraphrase generation, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.60)

Add feedback

Bridging Discrete and Backpropagation: Straight-Through and Beyond

Neural Information Processing SystemsDec-24-2025, 07:26:31 GMT

Backpropagation, the cornerstone of deep learning, is limited to computing gradients for continuous variables. This limitation poses challenges for problems involving discrete latent variables. To address this issue, we propose a novel approach to approximate the gradient of parameters involved in generating discrete latent variables. First, we examine the widely used Straight-Through (ST) heuristic and demonstrate that it works as a first-order approximation of the gradient. Guided by our findings, we propose ReinMax, which achieves second-order accuracy by integrating Heun's method, a second-order numerical method for solving ODEs. ReinMax does not require Hessian or other second-order derivatives, thus having negligible computation overheads. Extensive experimental results on various tasks demonstrate the superiority of ReinMax over the state of the art.

bridging discrete and backpropagation, discrete latent variable, name change, (4 more...)

Neural Information Processing Systems

Genre: Research Report (0.62)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)

Add feedback

Neural Discrete Representation Learning

Aaron van den Oord, Oriol Vinyals, koray kavukcuoglu

Neural Information Processing SystemsNov-21-2025, 11:16:58 GMT

Learning useful representations without supervision remains a key challenge in machine learning.

artificial intelligence, arxiv preprint arxiv, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > California > Los Angeles County > Long Beach (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.94)

Add feedback

Learning Disentangled Joint Continuous and Discrete Representations

Emilien Dupont

Neural Information Processing SystemsNov-20-2025, 19:36:36 GMT

We present a framework for learning disentangled and interpretable jointly continuous and discrete representations in an unsupervised manner.

artificial intelligence, latent variable, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Mateo County > Menlo Park (0.04)
North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

GumBolt: Extending Gumbel trick to Boltzmann priors

Amir H. Khoshaman, Mohammad Amin

Neural Information Processing SystemsNov-20-2025, 18:46:32 GMT

Boltzmann machines (BMs) are appealing candidates for powerful priors in varia-tional autoencoders (V AEs), as they are capable of capturing nontrivial and multi-modal distributions over discrete variables.

artificial intelligence, arxiv preprint arxiv, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

dcc337bb2a4d25afefd9ab800721debb-Paper-Conference.pdf

Neural Information Processing SystemsSep-29-2025, 00:47:39 GMT

artificial intelligence, deep learning, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe (0.14)
North America > United States > California (0.14)

Industry: Energy > Oil & Gas (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback

Distillation of a tractable model from the VQ-VAE

Hadžić, Armin, Papez, Milan, Pevný, Tomáš

arXiv.org Artificial IntelligenceSep-3-2025

Deep generative models with discrete latent space, such as the Vector-Quantized Variational Autoencoder (VQ-VAE), offer excellent data generation capabilities, but, due to the large size of their latent space, their probabilistic inference is deemed intractable. We demonstrate that the VQ-VAE can be distilled into a tractable model by selecting a subset of latent variables with high probabilities. This simple strategy is particularly efficient, especially if the VQ-VAE underutilizes its latent space, which is, indeed, very often the case. We frame the distilled model as a probabilistic circuit, and show that it preserves expressiveness of the VQ-VAE while providing tractable probabilistic inference. Experiments illustrate competitive performance in density estimation and conditional generation tasks, challenging the view of the VQ-VAE as an inherently intractable model.

artificial intelligence, latent space, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2509.014

Country: Europe > Czechia (0.14)

Genre: Research Report (1.00)

Technology: