Comparing Comparators in Generalization Bounds
Hellström, Fredrik, Guedj, Benjamin
We derive generic information-theoretic and PAC-Bayesian generalization bounds involving an arbitrary convex comparator function, which measures the discrepancy between the training and population loss. The bounds hold under the assumption that the cumulant-generating function (CGF) of the comparator is upper-bounded by the corresponding CGF within a family of bounding distributions. We show that the tightest possible bound is obtained with the comparator being the convex conjugate of the CGF of the bounding distribution, also known as the Cram\'er function. This conclusion applies more broadly to generalization bounds with a similar structure. This confirms the near-optimality of known bounds for bounded and sub-Gaussian losses and leads to novel bounds under other bounding distributions.
Oct-16-2023
- Country:
- Europe
- Austria > Styria
- Graz (0.04)
- Finland (0.04)
- France (0.04)
- Italy (0.04)
- Spain
- Andalusia > Cádiz Province
- Cadiz (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Andalusia > Cádiz Province
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.14)
- Austria > Styria
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- United States
- California
- Los Angeles County > Long Beach (0.04)
- San Diego County > San Diego (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Maryland > Baltimore (0.04)
- New Jersey > Mercer County
- Princeton (0.04)
- New York (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- California
- Canada
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Victoria > Melbourne (0.04)
- Europe
- Genre:
- Research Report (0.64)
- Technology: