On the Discrimination Risk of Mean Aggregation Feature Imputation in Graphs

Jan-18-2025, 23:57:49 GMT–Neural Information Processing Systems

In human networks, nodes belonging to a marginalized group often have a disproportionate rate of unknown or missing features. This, in conjunction with graph structure and known feature biases, can cause graph feature imputation algorithms to predict values for unknown features that make the marginalized group's feature values more distinct from the the dominant group's feature values than they are in reality. We call this distinction the discrimination risk. We prove that a higher discrimination risk can amplify the unfairness of a machine learning model applied to the imputed data. We then formalize a general graph feature imputation framework called mean aggregation imputation and theoretically and empirically characterize graphs in which applying this framework can yield feature values with a high discrimination risk.

discrimination risk, feature value, mean aggregation feature imputation, (2 more...)

Neural Information Processing Systems

Jan-18-2025, 23:57:49 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.83)