Causal Information Splitting: Engineering Proxy Features for Robustness to Distribution Shifts

Mazaheri, Bijan, Mastakouri, Atalanti, Janzing, Dominik, Hardt, Michaela

Jul-31-2023–arXiv.org Artificial Intelligence

Statistical prediction models are often trained on data from different probability distributions than their eventual use cases. One approach to proactively prepare for these shifts harnesses the intuition that causal mechanisms should remain invariant between environments. Here we focus on a challenging setting in which the causal and anticausal variables of the target are unobserved. Leaning on information theory, we develop feature selection and engineering techniques for the observed downstream variables that act as proxies. We identify proxies that help to build stable models and moreover utilize auxiliary training tasks to answer counterfactual questions that extract stability-enhancing information from proxies. We demonstrate the effectiveness of our techniques on synthetic and real data.

artificial intelligence, information, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Jul-31-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California > Los Angeles County > Pasadena (0.04)
- Europe
  - France (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Germany > Baden-Württemberg
    - Tübingen Region > Tübingen (0.04)

Genre:
- Research Report (0.82)

Industry:
- Health & Medicine (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Uncertainty (0.68)
  - Machine Learning > Neural Networks (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found