SupplementaryMaterial

Feb-9-2026, 13:05:24 GMT–Neural Information Processing Systems

R φqφ(z)dz = 0. Thus, the gradient of the log-variance loss becomes equaltothegradientofthe KL divergence. Therefore, for large enough D, the condition from Proposition 3 (see Eq. 19), is fulfilled and the statement follows immediately. This result isexpected to extend to the multivariate cases as well. For all the experiments listed in the main text, we use the VarGrad estimator for the gradients of the logistic regression models. VarGrad achieves considerable variance reduction over the adaptive (RELAX) and non-adaptive (ControlledReinforce)model-agnosticestimators.

artificial intelligence, estimator, machine learning, (13 more...)

Neural Information Processing Systems

Feb-9-2026, 13:05:24 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.54)

Duplicate Docs Excel Report

Title
Supplementary Material A Proof of Paper Results

Similar Docs Excel Report more

Title	Similarity	Source
None found