Empirical Risk Minimization with Relative Entropy Regularization

Perlaza, Samir M., Bisson, Gaetan, Esnaola, Iñaki, Jean-Marie, Alain, Rini, Stefano

Nov-21-2023–arXiv.org Artificial Intelligence

The empirical risk minimization (ERM) problem with relative entropy regularization (ERM-RER) is investigated under the assumption that the reference measure is a {\sigma}-finite measure, and not necessarily a probability measure. Under this assumption, which leads to a generalization of the ERM-RER problem allowing a larger degree of flexibility for incorporating prior knowledge, numerous relevant properties are stated. Among these properties, the solution to this problem, if it exists, is shown to be a unique probability measure, often mutually absolutely continuous with the reference measure. Such a solution exhibits a probably-approximately-correct guarantee for the ERM problem independently of whether the latter possesses a solution. For a fixed dataset, the empirical risk is shown to be a sub-Gaussian random variable when the models are sampled from the solution to the ERM-RER problem. The generalization capabilities of the solution to the ERM-RER problem (the Gibbs algorithm) are studied via the sensitivity of the expected empirical risk to deviations from such a solution towards alternative probability measures. Finally, an interesting connection between sensitivity, generalization error, and lautum information is established

artificial intelligence, machine learning, probability measure, (17 more...)

arXiv.org Artificial Intelligence

Nov-21-2023

arXiv.org PDF

Add feedback

Country:
- Asia (0.92)
- Europe > United Kingdom
  - England > Oxfordshire > Oxford (0.14)
- North America > United States
  - New Jersey (0.14)

Genre:
- Research Report > Promising Solution (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Statistical Learning (0.67)
  - Representation & Reasoning (0.93)