AITopics | conditional variational autoencoder

Learning Manifold Dimensions with Conditional Variational Autoencoders

Neural Information Processing SystemsDec-25-2025, 12:46:02 GMT

Although the variational autoencoder (VAE) and its conditional extension (CVAE) are capable of state-of-the-art results across multiple domains, their precise behavior is still not fully understood, particularly in the context of data (like images) that lie on or near a low-dimensional manifold. For example, while prior work has suggested that the globally optimal VAE solution can learn the correct manifold dimension, a necessary (but not sufficient) condition for producing samples from the true data distribution, this has never been rigorously proven. Moreover, it remains unclear how such considerations would change when various types of conditioning variables are introduced, or when the data support is extended to a union of manifolds (e.g., as is likely the case for MNIST digits and related). In this work, we address these points by first proving that VAE global minima are indeed capable of recovering the correct manifold dimension. We then extend this result to more general CVAEs, demonstrating practical scenarios whereby the conditioning variables allow the model to adaptively learn manifolds of varying dimension across samples. Our analyses, which have practical implications for various CVAE design choices, are also supported by numerical results on both synthetic and real-world datasets.

conditional variational autoencoder, learning manifold dimension, manifold dimension, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.43)

Add feedback

Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Neural Information Processing SystemsDec-24-2025, 04:38:16 GMT

Discrete latent spaces in variational autoencoders have been shown to effectively capture the data distribution for many real-world problems such as natural language understanding, human intent prediction, and visual scene representation. However, discrete latent spaces need to be sufficiently large to capture the complexities of real-world data, rendering downstream tasks computationally challenging. For instance, performing motion planning in a high-dimensional latent representation of the environment could be intractable. We consider the problem of sparsifying the discrete latent space of a trained conditional variational autoencoder, while preserving its learned multimodality. As a post hoc latent space reduction technique, we use evidential theory to identify the latent classes that receive direct evidence from a particular input condition and filter out those that do not. Experiments on diverse tasks, such as image generation and human behavior prediction, demonstrate the effectiveness of our proposed technique at reducing the discrete latent sample space size of a model while maintaining its learned multimodality.

conditional variational autoencoder, evidential sparsification, multimodal latent space, (7 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.98)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.88)

Add feedback

Deep Learning-Driven Prediction of Microstructure Evolution via Latent Space Interpolation

Gaikwad, Sachin, Kasilingam, Thejas, Ahmad, Owais, Mukherjee, Rajdip, Bhowmick, Somnath

arXiv.org Artificial IntelligenceOct-30-2025

Phase-field models accurately simulate microstructure evolution, but their dependence on solving complex differential equations makes them computationally expensive. This work achieves a significant acceleration via a novel deep learning-based framework, utilizing a Conditional Variational Autoencoder (CVAE) coupled with Cubic Spline Interpolation and Spherical Linear Interpolation (SLERP). We demonstrate the method for binary spinodal decomposition by predicting microstructure evolution for intermediate alloy compositions from a limited set of training compositions. First, using microstructures from phase-field simulations of binary spinodal decomposition, we train the CVAE, which learns compact latent representations that encode essential morphological features. Next, we use cubic spline interpolation in the latent space to predict microstructures for any unknown composition. Finally, SLERP ensures smooth morphological evolution with time that closely resembles coarsening. The predicted microstructures exhibit high visual and statistical similarity to phase-field simulations. This framework offers a scalable and efficient surrogate model for microstructure evolution, enabling accelerated materials design and composition optimization.

artificial intelligence, deep learning, machine learning, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1103/5ngk-4v9j

2508.01822

Country:

Asia > India (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Neural Information Processing SystemsMay-27-2025, 03:12:47 GMT

Discrete latent spaces in variational autoencoders have been shown to effectively capture the data distribution for many real-world problems such as natural language understanding, human intent prediction, and visual scene representation. However, discrete latent spaces need to be sufficiently large to capture the complexities of real-world data, rendering downstream tasks computationally challenging. For instance, performing motion planning in a high-dimensional latent representation of the environment could be intractable. We consider the problem of sparsifying the discrete latent space of a trained conditional variational autoencoder, while preserving its learned multimodality. As a post hoc latent space reduction technique, we use evidential theory to identify the latent classes that receive direct evidence from a particular input condition and filter out those that do not. Experiments on diverse tasks, such as image generation and human behavior prediction, demonstrate the effectiveness of our proposed technique at reducing the discrete latent sample space size of a model while maintaining its learned multimodality.

artificial intelligence, machine learning, natural language, (7 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Review for NeurIPS paper: Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Neural Information Processing SystemsJan-25-2025, 19:32:20 GMT

Weaknesses: I find three points of weakness that decrease the potential impact of the work: i) References are too focused on "application" papers and evidential theory, while authors want to present a new methodology for reducing the discrete latent space dimensionality in auto-encoders. Well, if authors include more references or comments about theoretical papers of VAEs, this work could be better contrasted with other similar works, and will potentially facilitate its disclosure.. ii) Apart from the references, authors fail on the fact of not including a short paragraph or subsection about the CVAE with a few details to refresh the ideas and having a work that is totally self-contained. They could have sacrificed half-page of experiments to described the conditional auto-encoder better. So, if the number 9 was badly compressed in the latent space, and then so many other dimensions removed, after re-normalising, the number 9 gets importance? is that what is happening? The other question is about Table 1 and the accuracy performance under the 50% in classification, pretty bad, right?

conditional variational autoencoder, evidential sparsification, multimodal latent space, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.97)

Add feedback

Review for NeurIPS paper: Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Neural Information Processing SystemsJan-25-2025, 19:32:12 GMT

All the reviewers were relatively positive about this paper but there were some concerns, mainly with regards to details of the experimental comparison and related work. These have been clarified during the rebuttal and the reviewers were happy to recommend acceptance.

conditional variational autoencoder, evidential sparsification, multimodal latent space, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback

Learning Manifold Dimensions with Conditional Variational Autoencoders

Neural Information Processing SystemsJan-19-2025, 02:51:05 GMT

Although the variational autoencoder (VAE) and its conditional extension (CVAE) are capable of state-of-the-art results across multiple domains, their precise behavior is still not fully understood, particularly in the context of data (like images) that lie on or near a low-dimensional manifold. For example, while prior work has suggested that the globally optimal VAE solution can learn the correct manifold dimension, a necessary (but not sufficient) condition for producing samples from the true data distribution, this has never been rigorously proven. Moreover, it remains unclear how such considerations would change when various types of conditioning variables are introduced, or when the data support is extended to a union of manifolds (e.g., as is likely the case for MNIST digits and related). In this work, we address these points by first proving that VAE global minima are indeed capable of recovering the correct manifold dimension. We then extend this result to more general CVAEs, demonstrating practical scenarios whereby the conditioning variables allow the model to adaptively learn manifolds of varying dimension across samples.

conditional variational autoencoder, correct manifold dimension, learning manifold dimension, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.64)

Add feedback

CRepair: CVAE-based Automatic Vulnerability Repair Technology

Liu, Penghui, Bi, Yingzhou, Huang, Jiangtao, Jiang, Xinxin, Wang, Lianmei

arXiv.org Artificial IntelligenceNov-8-2024

Software vulnerabilities are flaws in computer software systems that pose significant threats to the integrity, security, and reliability of modern software and its application data. These vulnerabilities can lead to substantial economic losses across various industries. Manual vulnerability repair is not only time-consuming but also prone to errors. To address the challenges of vulnerability repair, researchers have proposed various solutions, with learning-based automatic vulnerability repair techniques gaining widespread attention. However, existing methods often focus on learning more vulnerability data to improve repair outcomes, while neglecting the diverse characteristics of vulnerable code, and suffer from imprecise vulnerability localization.To address these shortcomings, this paper proposes CRepair, a CVAE-based automatic vulnerability repair technology aimed at fixing security vulnerabilities in system code. We first preprocess the vulnerability data using a prompt-based method to serve as input to the model. Then, we apply causal inference techniques to map the vulnerability feature data to probability distributions. By employing multi-sample feature fusion, we capture diverse vulnerability feature information. Finally, conditional control is used to guide the model in repairing the vulnerabilities.Experimental results demonstrate that the proposed method significantly outperforms other benchmark models, achieving a perfect repair rate of 52%. The effectiveness of the approach is validated from multiple perspectives, advancing AI-driven code vulnerability repair and showing promising applications.

data mining, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2411.0554

Country: Asia > China > Guangxi Province > Nanning (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

Add feedback

Univariate Conditional Variational Autoencoder for Morphogenic Patterns Design in Frontal Polymerization-Based Manufacturing

Liu, Qibang, Cai, Pengfei, Abueidda, Diab, Vyas, Sagar, Koric, Seid, Gomez-Bombarelli, Rafael, Geubelle, Philippe

arXiv.org Artificial IntelligenceOct-31-2024

Under some initial and boundary conditions, the rapid reaction-thermal diffusion process taking place during frontal polymerization (FP) destabilizes the planar mode of front propagation, leading to spatially varying, complex hierarchical patterns in thermoset polymeric materials. Although modern reaction-diffusion models can predict the patterns resulting from unstable FP, the inverse design of patterns, which aims to retrieve process conditions that produce a desired pattern, remains an open challenge due to the non-unique and non-intuitive mapping between process conditions and manufactured patterns. In this work, we propose a probabilistic generative model named univariate conditional variational autoencoder (UcVAE) for the inverse design of hierarchical patterns in FP-based manufacturing. Unlike the cVAE, which encodes both the design space and the design target, the UcVAE encodes only the design space. In the encoder of the UcVAE, the number of training parameters is significantly reduced compared to the cVAE, resulting in a shorter training time while maintaining comparable performance. Given desired pattern images, the trained UcVAE can generate multiple process condition solutions that produce high-fidelity hierarchical patterns.

artificial intelligence, machine learning, process condition, (18 more...)

arXiv.org Artificial Intelligence

2410.17518

Country:

North America > United States > Illinois (0.16)
Europe > Netherlands (0.14)
Europe > Germany (0.14)
Asia > Middle East > UAE (0.14)

Genre: Research Report (0.82)

Industry:

Government > Regional Government > North America Government > United States Government (0.67)
Energy > Oil & Gas > Downstream (0.63)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Neural Information Processing SystemsOct-10-2024, 13:19:15 GMT

Discrete latent spaces in variational autoencoders have been shown to effectively capture the data distribution for many real-world problems such as natural language understanding, human intent prediction, and visual scene representation. However, discrete latent spaces need to be sufficiently large to capture the complexities of real-world data, rendering downstream tasks computationally challenging. For instance, performing motion planning in a high-dimensional latent representation of the environment could be intractable. We consider the problem of sparsifying the discrete latent space of a trained conditional variational autoencoder, while preserving its learned multimodality. As a post hoc latent space reduction technique, we use evidential theory to identify the latent classes that receive direct evidence from a particular input condition and filter out those that do not. Experiments on diverse tasks, such as image generation and human behavior prediction, demonstrate the effectiveness of our proposed technique at reducing the discrete latent sample space size of a model while maintaining its learned multimodality.

conditional variational autoencoder, evidential sparsification, multimodal latent space, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Collaborating Authors

conditional variational autoencoder

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Learning Manifold Dimensions with Conditional Variational Autoencoders

Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Deep Learning-Driven Prediction of Microstructure Evolution via Latent Space Interpolation

Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Review for NeurIPS paper: Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Review for NeurIPS paper: Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Learning Manifold Dimensions with Conditional Variational Autoencoders

CRepair: CVAE-based Automatic Vulnerability Repair Technology

Univariate Conditional Variational Autoencoder for Morphogenic Patterns Design in Frontal Polymerization-Based Manufacturing

Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders