Differentiable Sampling of Categorical Distributions Using the CatLog-Derivative Trick

Jan-18-2025, 19:03:08 GMT–Neural Information Processing Systems

Categorical random variables can faithfully represent the discrete and uncertain aspects of data as part of a discrete latent variable model. Learning in such models necessitates taking gradients with respect to the parameters of the categorical probability distributions, which is often intractable due to their combinatorial nature. A popular technique to estimate these otherwise intractable gradients is the Log-Derivative trick. This trick forms the basis of the well-known REINFORCE gradient estimator and its many extensions. While the Log-Derivative trick allows us to differentiate through samples drawn from categorical distributions, it does not take into account the discrete nature of the distribution itself.

categorical distribution, catlog-derivative trick, differentiable sampling, (3 more...)

Neural Information Processing Systems

Jan-18-2025, 19:03:08 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.43)