Large-Scale Stochastic Sampling from the Probability Simplex

Jack Baker, Paul Fearnhead, Emily Fox, Christopher Nemeth

May-26-2025, 08:18:49 GMT–Neural Information Processing Systems

Stochastic gradient Markov chain Monte Carlo (SGMCMC) has become a popular method for scalable Bayesian inference. These methods are based on sampling a discrete-time approximation to a continuous time process, such as the Langevin diffusion. When applied to distributions defined on a constrained space the timediscretization error can dominate when we are near the boundary of the space. We demonstrate that because of this, current SGMCMC methods for the simplex struggle with sparse simplex spaces; when many of the components are close to zero. Unfortunately, many popular large-scale Bayesian models, such as network or topic models, require inference on sparse simplex spaces.

artificial intelligence, machine learning, sampler, (16 more...)

Neural Information Processing Systems

May-26-2025, 08:18:49 GMT

Conferences PDF

Add feedback

Country:
- North America
  - Canada (0.14)
  - United States (0.14)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Learning Graphical Models
      - Directed Networks > Bayesian Learning (0.48)
      - Undirected Networks > Markov Models (0.67)
    - Statistical Learning (0.91)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.86)