Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Oct-10-2024, 17:27:01 GMT–Neural Information Processing Systems

Many applications of generative models rely on the marginalization of their high-dimensional output probability distributions. Normalization functions that yield sparse probability distributions can make exact marginalization more computationally tractable. However, sparse normalization functions usually require alternative loss functions for training since the log-likelihood is undefined for sparse probability distributions. In this work, we present ev-softmax, a sparse normalization function that preserves the multimodality of probability distributions. We derive its properties, including its gradient in closed-form, and introduce a continuous family of approximations to ev-softmax that have full support and can be trained with probabilistic loss functions such as negative log-likelihood and Kullback-Leibler divergence.

deep generative model, probability distribution, sparse multimodal distribution, (5 more...)

Neural Information Processing Systems

Oct-10-2024, 17:27:01 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Generation (0.71)
  - Machine Learning
    - Statistical Learning (0.96)
    - Neural Networks > Deep Learning
      - Generative AI (0.40)