Deep Proxy Causal Learning and its Application to Confounded Bandit Policy Evaluation

Feb-10-2025, 23:26:30 GMT–Neural Information Processing Systems

Proxy causal learning (PCL) is a method for estimating the causal effect of treatments on outcomes in the presence of unobserved confounding, using proxies (structured side information) for the confounder. This is achieved via two-stage regression: in the first stage, we model relations among the treatment and proxies; in the second stage, we use this model to learn the effect of treatment on the outcome, given the context provided by the proxies. We propose a novel method for PCL, the deep feature proxy variable method (DFPV), to address the case where the proxies, treatments, and outcomes are high-dimensional and have nonlinear complex relationships, as represented by deep neural network features. We show that DFPV outperforms recent state-of-the-art PCL methods on challenging synthetic benchmarks, including settings involving high dimensional image data. Furthermore, we show that PCL can be applied to off-policy evaluation for the confounded bandit problem, in which DFPV also exhibits competitive performance.

application, confounded bandit policy evaluation, deep proxy causal learning, (2 more...)

Neural Information Processing Systems

Feb-10-2025, 23:26:30 GMT

Conferences Web Page

Add feedback

Country:
- Asia > Japan > Honshū > Kantō > Kanagawa Prefecture (0.09)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.64)