Review for NeurIPS paper: Decisions, Counterfactual Explanations and Strategic Behavior

Neural Information Processing Systems 

This paper proposes and analyzes a model of strategic behavior under counterfactual explanations. In this model, a decision-maker chooses a policy and a small set of explanations that can be provided to decisions subjects who receive unfavorable decisions. In response, decision subjects follow the given explanation to improve their future outcomes. While doing so is NP Hard, the resulting formulation is shown to be submodular allowing for efficient approximations. This paper establishes an interesting connection between strategic behavior and explainability.