FairGBM: Gradient Boosting with Fairness Constraints

Cruz, André F, Belém, Catarina, Jesus, Sérgio, Bravo, João, Saleiro, Pedro, Bizarro, Pedro

Mar-3-2023–arXiv.org Artificial Intelligence

Tabular data is prevalent in many high-stakes domains, such as financial services or public policy. Gradient Boosted Decision Trees (GBDT) are popular in these settings due to their scalability, performance, and low training cost. While fairness in these domains is a foremost concern, existing in-processing Fair ML methods are either incompatible with GBDT, or incur in significant performance losses while taking considerably longer to train. We present FairGBM, a dual ascent learning framework for training GBDT under fairness constraints, with little to no impact on predictive performance when compared to unconstrained GBDT. Since observational fairness metrics are non-differentiable, we propose smooth convex error rate proxies for common fairness criteria, enabling gradient-based optimization using a ``proxy-Lagrangian'' formulation. Our implementation shows an order of magnitude speedup in training time relative to related work, a pivotal aspect to foster the widespread adoption of FairGBM by real-world practitioners.

artificial intelligence, machine learning, optimization problem, (19 more...)

arXiv.org Artificial Intelligence

Mar-3-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - New York > New York County
      - New York City (0.04)
    - Georgia > Fulton County
      - Atlanta (0.04)
  - Puerto Rico > San Juan
    - San Juan (0.04)
- Europe
  - Portugal (0.04)
  - France (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Germany > Baden-Württemberg
    - Tübingen Region > Tübingen (0.14)
- Asia > China
  - Hong Kong (0.04)

Genre:
- Research Report (1.00)

Industry:
- Banking & Finance (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (0.93)
  - Machine Learning
    - Statistical Learning (1.00)
    - Performance Analysis > Accuracy (1.00)
    - Ensemble Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found