Appendix

May-25-2025, 03:16:54 GMT–Neural Information Processing Systems

Despite initial evidence that explanations might be useful for detecting that a model is reliant on spurious signals [Lapuschkin et al., 2019, Rieger et al., 2020], a different line of work directly counters this evidence. Zimmermann et al. [2021] showed that feature visualizations [Olah et al., 2017] are not more effective than dataset examples at improving a human's understanding of the features that highly activate a DNN's intermediate neuron. Increasing evidence demonstrates that current post hoc explanation approaches might be ineffective for model debugging in practice [Chen et al., 2021, Alqaraawi et al., 2020, Ghassemi et al., 2021, Balagopalan et al., 2022, Poursabzi-Sangdeh et al., 2018, Bolukbasi et al., 2021]. In a promising demonstration, Lapuschkin et al. [2019] apply a clustering procedure to the LRP saliency masks derived from a trained model. In the application, the clusters that emerge are able to separate groups of inputs where, presumably, the model relies on different features for its output decision. This work differs from that in a key way: Lapuschkin et al. [2019] demonstration is to seek understanding of the model behavior and not to perform slice discovery. There is no reason why a low performing cluster should emerge from such clustering procedure. Schioppa et al. [2022] address this problem by forming a low-rank approximation of H They choose D to be around 50 in their experiments.

artificial intelligence, machine learning, nan 0, (15 more...)

Neural Information Processing Systems

May-25-2025, 03:16:54 GMT

Conferences PDF

Add feedback

Country:
- Africa > Nigeria (0.14)

Industry:
- Health & Medicine
  - Diagnostic Medicine > Imaging (0.93)
  - Epidemiology (0.70)
  - Nuclear Medicine (0.93)
  - Therapeutic Area
    - Immunology (1.00)
    - Infections and Infectious Diseases (1.00)
    - Pulmonary/Respiratory Diseases (0.69)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)

Duplicate Docs Excel Report

Title
Appendix A Additional Related Work

Similar Docs Excel Report more

Title	Similarity	Source
None found