KL Divergence Estimation with Multi-group Attribution

Gopalan, Parikshit, Narodytska, Nina, Reingold, Omer, Sharan, Vatsal, Wieder, Udi

Feb-28-2022–arXiv.org Machine Learning

Estimating the Kullback-Leibler (KL) divergence between two distributions given samples from them is well-studied in machine learning and information theory. Motivated by considerations of multi-group fairness, we seek KL divergence estimates that accurately reflect the contributions of sub-populations to the overall divergence. We model the sub-populations coming from a rich (possibly infinite) family $\mathcal{C}$ of overlapping subsets of the domain. We propose the notion of multi-group attribution for $\mathcal{C}$, which requires that the estimated divergence conditioned on every sub-population in $\mathcal{C}$ satisfies some natural accuracy and fairness desiderata, such as ensuring that sub-populations where the model predicts significant divergence do diverge significantly in the two distributions. Our main technical contribution is to show that multi-group attribution can be derived from the recently introduced notion of multi-calibration for importance weights [HKRR18, GRSW21]. We provide experimental evidence to support our theoretical results, and show that multi-group attribution provides better KL divergence estimates when conditioned on sub-populations than other popular algorithms.

algorithm, divergence, multi-group attribution, (16 more...)

arXiv.org Machine Learning

Feb-28-2022

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Michigan (0.04)
    - California > Santa Clara County
      - Palo Alto (0.04)
  - Canada > British Columbia
    - Vancouver Island > Capital Regional District > Victoria (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report > New Finding (0.68)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning (0.68)
  - Decision Tree Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found