Non-Asymptotic Performance Guarantees for Neural Estimation of $\mathsf{f}$-Divergences
Sreekumar, Sreejith, Zhang, Zhengxin, Goldfeld, Ziv
Statistical distances (SDs), which quantify the dissimilarity between probability distributions, are central to machine learning and statistics. A modern method for estimating such distances from data relies on parametrizing a variational form by a neural network (NN) and optimizing it. These estimators are abundantly used in practice, but corresponding performance guarantees are partial and call for further exploration. In particular, there seems to be a fundamental tradeoff between the two sources of error involved: approximation and estimation. While the former needs the NN class to be rich and expressive, the latter relies on controlling complexity. This paper explores this tradeoff by means of non-asymptotic error bounds, focusing on three popular choices of SDs -- Kullback-Leibler divergence, chi-squared divergence, and squared Hellinger distance. Our analysis relies on non-asymptotic function approximation theorems and tools from empirical process theory. Numerical results validating the theory are also provided.
Mar-16-2021
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States
- New York > Tompkins County
- Ithaca (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California
- San Diego County > San Diego (0.04)
- Los Angeles County > Long Beach (0.04)
- New York > Tompkins County
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Alberta > Census Division No. 15
- Improvement District No. 9 > Banff (0.04)
- United States
- Europe
- Asia
- Oceania > Australia
- Genre:
- Research Report (0.50)
- Technology: