Learning under random distributional shifts

Bansak, Kirk, Paulson, Elisabeth, Rothenhäusler, Dominik

Oct-30-2023–arXiv.org Machine Learning

In various real-world settings, however, we might expect shifts to arise through the superposition of many small and random changes in the population and environment. Thus, we consider a class of random distribution shift models that capture arbitrary changes in the underlying covariate space, and dense, random shocks to the relationship between the covariates and the outcomes. In this setting, we characterize the benefits and drawbacks of several alternative prediction strategies: the standard approach that directly predicts the long-term outcomes of interest, the proxy approach that directly predicts a shorter-term proxy outcome, and a hybrid approach that utilizes both the long-term policy outcome and (shorter-term) proxy outcome(s). We show that the hybrid approach is robust to the strength of the distribution shift and the proxy relationship. We apply this method to datasets in two high-impact domains: asylum-seeker resettlement and early childhood education. In both settings, we find that the proposed approach results in substantially lower mean-squared error than current approaches.

artificial intelligence, distribution shift, machine learning, (16 more...)

arXiv.org Machine Learning

Oct-30-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Tennessee (0.04)
  - California
    - Santa Clara County > Palo Alto (0.04)
    - Alameda County > Berkeley (0.04)
- Europe
  - Netherlands (0.05)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Switzerland > Zürich
    - Zürich (0.04)
- Asia > Middle East
  - Republic of Türkiye (0.04)

Genre:
- Research Report > New Finding (0.92)

Industry:
- Law (0.88)
- Government
  - Regional Government (1.00)
  - Immigration & Customs (1.00)
- Education
  - Educational Setting > K-12 Education (0.47)
  - Assessment & Standards > Student Performance (0.46)

Technology:
- Information Technology
  - Data Science (0.67)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Machine Learning > Statistical Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found