Subset Selection Based On Multiple Rankings in the Presence of Bias: Effectiveness of Fairness Constraints for Multiwinner Voting Score Functions
Boehmer, Niclas, Celis, L. Elisa, Huang, Lingxiao, Mehrotra, Anay, Vishnoi, Nisheeth K.
–arXiv.org Artificial Intelligence
We consider the problem of subset selection where one is given multiple rankings of items and the goal is to select the highest ``quality'' subset. Score functions from the multiwinner voting literature have been used to aggregate rankings into quality scores for subsets. We study this setting of subset selection problems when, in addition, rankings may contain systemic or unconscious biases toward a group of items. For a general model of input rankings and biases, we show that requiring the selected subset to satisfy group fairness constraints can improve the quality of the selection with respect to unbiased rankings. Importantly, we show that for fairness constraints to be effective, different multiwinner score functions may require a drastically different number of rankings: While for some functions, fairness constraints need an exponential number of rankings to recover a close-to-optimal solution, for others, this dependency is only polynomial. This result relies on a novel notion of ``smoothness'' of submodular functions in this setting that quantifies how well a function can ``correctly'' assess the quality of items in the presence of bias. The results in this paper can be used to guide the choice of multiwinner score functions for the subset selection setting considered here; we additionally provide a tool to empirically enable this.
arXiv.org Artificial Intelligence
Jun-16-2023
- Country:
- Oceania
- New Zealand > North Island
- Auckland Region > Auckland (0.04)
- Australia > Victoria
- Melbourne (0.04)
- New Zealand > North Island
- North America
- United States
- Maryland > Baltimore (0.04)
- Texas > Travis County
- Austin (0.04)
- Virginia > Arlington County
- Arlington (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Washington > King County
- Seattle (0.04)
- California
- San Francisco County > San Francisco (0.27)
- Los Angeles County > Long Beach (0.04)
- Colorado > Boulder County
- Boulder (0.04)
- New York > New York County
- New York City (0.04)
- Canada
- United States
- Europe
- Hungary (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Spain
- Valencian Community > Valencia Province
- Valencia (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Valencian Community > Valencia Province
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Italy > Sicily
- Palermo (0.04)
- Germany > North Rhine-Westphalia
- Düsseldorf Region > Düsseldorf (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- France > Île-de-France
- Netherlands > North Brabant
- Eindhoven (0.04)
- Asia
- Macao (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- China
- Beijing > Beijing (0.04)
- Jiangsu Province > Nanjing (0.04)
- Oceania
- Genre:
- Research Report (0.82)
- Industry:
- Health & Medicine (1.00)
- Education (1.00)
- Technology: