Trenton Chang 1 Lindsay Warrenburg

Mar-20-2025, 09:10:29 GMT–Neural Information Processing Systems

In many settings, machine learning models may be used to inform decisions that impact individuals or entities who interact with the model. Such entities, or agents, may game model decisions by manipulating their inputs to the model to obtain better outcomes and maximize some utility. We consider a multi-agent setting where the goal is to identify the "worst offenders:" agents that are gaming most aggressively. However, identifying such agents is difficult without being able to evaluate their utility function. Thus, we introduce a framework featuring a gaming deterrence parameter, a scalar that quantifies an agent's (un)willingness to game. We show that this gaming parameter is only partially identifiable. By recasting the problem as a causal effect estimation problem where different agents represent different "treatments," we prove that a ranking of all agents by their gaming parameters is identifiable. We present empirical results in a synthetic data study validating the usage of causal effect estimation for gaming detection and show in a case study of diagnosis coding behavior in the U.S. that our approach highlights features associated with gaming.

artificial intelligence, data mining, machine learning, (14 more...)

Neural Information Processing Systems

Mar-20-2025, 09:10:29 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (1.00)

Genre:
- Research Report > Experimental Study (1.00)

Industry:
- Banking & Finance > Insurance (1.00)
- Government > Regional Government
  - North America Government > United States Government (1.00)
- Health & Medicine
  - Government Relations & Public Policy (1.00)
  - Health Care Providers & Services > Reimbursement (1.00)
  - Therapeutic Area > Psychiatry/Psychology (0.67)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Performance Analysis
      - Accuracy (0.67)
    - Representation & Reasoning > Agents (1.00)
  - Data Science > Data Mining (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found