Removing Spurious Concepts from Neural Network Representations via Joint Subspace Estimation

Holstege, Floris, Wouters, Bram, van Giersbergen, Noud, Diks, Cees

Oct-18-2023–arXiv.org Machine Learning

This crucially differs from existing methods, which only focus on the spurious concept features, risking the loss of vital main-task information. Furthermore, we make the identification of the subspaces systematic by introducing statistical tests that attribute directions in the embedding space to either the main-task or the spurious concept. The method, which we call Joint Subspace Estimation (JSE), is shown to be robust against the strength of the spurious correlation and to outperform existing concept-removal methods for a Toy dataset as well as benchmark datasets for image recognition (Waterbirds, CelebA) and natural language processing (MultiNLI). A high-level overview of the method is given in Figure 1. Figure 1: High-level overview of Joint Subspace Estimation (JSE) for concept removal: the input x is fed through a neural network f(x), from which we can extract the vector representation z. Within the vector representation, two orthogonal subspaces are identified: one related to the spurious concept (the background), and one to the main-task concept (bird type). JSE estimates the subspaces of the two concepts simultaneously to prevent mixing of spurious and main-task features.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

Oct-18-2023

arXiv.org PDF

Add feedback

Country:
- Africa > Rwanda
  - Kigali > Kigali (0.04)
- Asia > Middle East
  - UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
- Europe
  - Austria (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
  - France > Hauts-de-France
    - Nord > Lille (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.04)
- North America
  - Canada > Ontario
    - Toronto (0.04)
  - Dominican Republic (0.04)
  - United States
    - California > San Diego County
      - San Diego (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - New York > New York County
      - New York City (0.04)

Genre:
- Research Report
  - Experimental Study (0.46)
  - New Finding (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Neural Networks > Deep Learning (0.46)
    - Statistical Learning (1.00)
  - Natural Language (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found