I-trustworthy Models. A framework for trustworthiness evaluation of probabilistic classifiers

Jan-26-2025–arXiv.org Machine Learning

As probabilistic models continue to permeate various facets of our society and contribute to scientific advancements, it becomes a necessity to go beyond traditional metrics such as predictive accuracy and error rates and assess their trustworthiness. Grounded in the competence-based theory of trust, this work formalizes I-trustworthy framework -- a novel framework for assessing the trustworthiness of probabilistic classifiers for inference tasks by linking local calibration to trustworthiness. To assess I-trustworthiness, we use the local calibration error (LCE) and develop a method of hypothesis-testing. This method utilizes a kernel-based test statistic, Kernel Local Calibration Error (KLCE), to test local calibration of a probabilistic classifier. This study provides theoretical guarantees by offering convergence bounds for an unbiased estimator of KLCE. Additionally, we present a diagnostic tool designed to identify and measure biases in cases of miscalibration. The effectiveness of the proposed test statistic is demonstrated through its application to both simulated and real-world datasets. Finally, LCE of related recalibration methods is studied, and we provide evidence of insufficiency of existing methods to achieve I-trustworthiness.

calibration, data mining, machine learning, (17 more...)

arXiv.org Machine Learning

Jan-26-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Florida > Broward County (0.04)
  - Texas > Travis County
    - Austin (0.04)
  - Michigan > Genesee County
    - Flint (0.04)
- Asia
  - Thailand (0.04)
  - Japan > Kyūshū & Okinawa
    - Okinawa Prefecture > Naha (0.04)

Genre:
- Research Report
  - Experimental Study (0.67)
  - New Finding (0.46)

Industry:
- Law (0.94)
- Health & Medicine > Therapeutic Area (0.46)
- Government > Regional Government (0.46)

Technology:
- Information Technology
  - Data Science > Data Mining (0.94)
  - Artificial Intelligence
    - Representation & Reasoning > Uncertainty (0.48)
    - Machine Learning
      - Performance Analysis > Accuracy (0.54)
      - Statistical Learning (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found