AITopics

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > Canada > Quebec > Montreal (0.04)

Industry: Health & Medicine (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Andrew Bennett, Nathan Kallus

Policy Evaluation with Latent Confounders via Optimal Balance

Neural Information Processing SystemsFeb-12-2026, 16:52:23 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, kallus, neural information processing system, (12 more...)

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.69)

Neural Information Processing SystemsFeb-11-2026, 06:07:23 GMT

Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning

Lemma 2.Suppose Assumptions 1 and 2 hold.

machine learning, neural information processing system, reinforcement learning, (15 more...)

Country:

North America > United States > New Jersey > Mercer County > Princeton (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

Andrew Bennett, Nathan Kallus

Policy Evaluation with Latent Confounders via Optimal Balance

Neural Information Processing SystemsOct-3-2025, 01:46:18 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, evaluation, machine learning, (17 more...)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)

Geetika, null, Tyagi, Somya, Chatterjee, Bapi

Federated Instrumental Variable Analysis via Federated Generalized Method of Moments

arXiv.org Machine LearningMay-28-2025

Instrumental variables (IV) analysis is an important applied tool for areas such as healthcare and consumer economics. For IV analysis in high-dimensional settings, the Generalized Method of Moments (GMM) using deep neural networks offers an efficient approach. With non-i.i.d. data sourced from scattered decentralized clients, federated learning is a popular paradigm for training the models while promising data privacy. However, to our knowledge, no federated algorithm for either GMM or IV analysis exists to date. In this work, we introduce federated instrumental variables analysis (FedIV) via federated generalized method of moments (FedGMM). We formulate FedGMM as a federated zero-sum game defined by a federated non-convex non-concave minimax optimization problem, which is solved using federated gradient descent ascent (FedGDA) algorithm. One key challenge arises in theoretically characterizing the federated local optimality. To address this, we present properties and existence results of clients' local equilibria via FedGDA limit points. Thereby, we show that the federated solution consistently estimates the local moment conditions of every participating client. The proposed algorithm is backed by extensive experiments to demonstrate the efficacy of our approach.

artificial intelligence, bayesian inference, machine learning, (18 more...)

2505.21012

Country:

Asia > Middle East > Jordan (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre:

Research Report > Experimental Study (0.67)
Personal > Honors (0.46)
Research Report > Strength High (0.46)

Industry:

Health & Medicine > Therapeutic Area (0.93)
Information Technology > Security & Privacy (0.86)

Clivio, Oscar, Feller, Avi, Holmes, Chris

Towards Representation Learning for Weighting Problems in Design-Based Causal Inference

arXiv.org Machine LearningSep-24-2024

Reweighting a distribution to minimize a distance to a target distribution is a powerful and flexible strategy for estimating a wide range of causal effects, but can be challenging in practice because optimal weights typically depend on knowledge of the underlying data generating process. In this paper, we focus on design-based weights, which do not incorporate outcome information; prominent examples include prospective cohort studies, survey weighting, and the weighting portion of augmented weighting estimators. In such applications, we explore the central role of representation learning in finding desirable weights in practice. Unlike the common approach of assuming a well-specified representation, we highlight the error due to the choice of a representation and outline a general framework for finding suitable representations that minimize this error. Building on recent work that combines balancing weights and neural networks, we propose an end-to-end estimation procedure that learns a flexible representation, while retaining promising theoretical properties. We show that this approach is competitive in a range of common causal inference tasks.

estimation, outcome model, representation, (16 more...)

2409.16407

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > Scotland > City of Edinburgh > Edinburgh (0.04)
Africa > Uganda (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > Strength High (0.93)

Industry: Health & Medicine (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Kallus, Nathan, Oprescu, Miruna

Robust and Agnostic Learning of Conditional Distributional Treatment Effects

arXiv.org Artificial IntelligenceFeb-24-2023

The conditional average treatment effect (CATE) is the best measure of individual causal effects given baseline covariates. However, the CATE only captures the (conditional) average, and can overlook risks and tail events, which are important to treatment choice. In aggregate analyses, this is usually addressed by measuring the distributional treatment effect (DTE), such as differences in quantiles or tail expectations between treatment groups. Hypothetically, one can similarly fit conditional quantile regressions in each treatment group and take their difference, but this would not be robust to misspecification or provide agnostic best-in-class predictions. We provide a new robust and model-agnostic methodology for learning the conditional DTE (CDTE) for a class of problems that includes conditional quantile treatment effects, conditional super-quantile treatment effects, and conditional treatment effects on coherent risk measures given by $f$-divergences. Our method is based on constructing a special pseudo-outcome and regressing it on covariates using any regression learner. Our method is model-agnostic in that it can provide the best projection of CDTE onto the regression model class. Our method is robust in that even if we learn these nuisances nonparametrically at very slow rates, we can still learn CDTEs at rates that depend on the class complexity and even conduct inferences on linear projections of CDTEs. We investigate the behavior of our proposal in simulations, as well as in a case study of 401(k) eligibility effects on wealth.

artificial intelligence, machine learning, treatment effect, (14 more...)

arXiv.org Artificial Intelligence

2205.11486

Country:

North America > United States (0.46)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.88)

Elmachtoub, Adam N., Gupta, Vishal, Zhao, Yunfan

Balanced Off-Policy Evaluation for Personalized Pricing

arXiv.org Artificial IntelligenceFeb-24-2023

We consider a personalized pricing problem in which we have data consisting of feature information, historical pricing decisions, and binary realized demand. The goal is to perform off-policy evaluation for a new personalized pricing policy that maps features to prices. Methods based on inverse propensity weighting (including doubly robust methods) for off-policy evaluation may perform poorly when the logging policy has little exploration or is deterministic, which is common in pricing applications. Building on the balanced policy evaluation framework of Kallus (2018), we propose a new approach tailored to pricing applications. The key idea is to compute an estimate that minimizes the worst-case mean squared error or maximizes a worst-case lower bound on policy performance, where in both cases the worst-case is taken with respect to a set of possible revenue functions. We establish theoretical convergence guarantees and empirically demonstrate the advantage of our approach using a real-world pricing dataset.

artificial intelligence, machine learning, target policy, (16 more...)

arXiv.org Artificial Intelligence

2302.12736

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Spain > Valencian Community > Valencia Province > Valencia (0.04)

Genre: Research Report (0.64)

Industry: Banking & Finance (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

arXiv.org Machine LearningOct-17-2021

Rejoinder: Learning Optimal Distributionally Robust Individualized Treatment Rules

Mo, Weibin, Qi, Zhengling, Liu, Yufeng

We thank the opportunity offered by editors for this discussion and the discussants for their insightful comments and thoughtful contributions. We also want to congratulate Kallus (2020) for his inspiring work in improving the efficiency of policy learning by retargeting. Motivated from the discussion in Dukes and Vansteelandt (2020), we first point out interesting connections and distinctions between our work and Kallus (2020) in Section 1. In particular, the assumptions and sources of variation for consideration in these two papers lead to different research problems with different scopes and focuses. In Section 2, following the discussions in Li et al. (2020); Liang and Zhao (2020), we also consider the efficient policy evaluation problem when we have some data from the testing distribution available at the training stage. We show that under the assumption that the sample sizes from training and testing are growing in the same order, efficient value function estimates can deliver competitive performance. We further show some connections of these estimates with existing literature. However, when the growth of testing sample size available for training is in a slower order, efficient value function estimates may not perform well anymore. In contrast, the requirement of the testing sample size for DRITR is not as strong as that of efficient policy evaluation using the combined data. Finally, we highlight the general applicability and usefulness of DRITR in Section 3.

covariate change, dritr, efficient estimate, (15 more...)

doi: 10.1080/01621459.2020.1866581

2110.08936

Country:

North America > United States > North Carolina > Orange County > Chapel Hill (0.04)
North America > United States > District of Columbia > Washington (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

arXiv.org Machine LearningDec-5-2020

Rejoinder: New Objectives for Policy Learning

Kallus, Nathan

I would like thank the discussants, Oliver Dukes and Stijn Vansteelandt (DV), Sijia Li, Xiudi Li, Alex Luedtkeand (LLL), and Muxuan Liang and Yingqi Zhao (LZ), for a very thoughtful discussion both of my contribution (Kallus 2020) and of Mo et al. (2020). I similarly thank the editors for putting together this exciting special issue and for curating a timely discussion on new objectives for policy learning. I found the juxtaposition between the two papers particularly apt: while my paper tries to induce an optimal covariate shift based on the premise of invariance, Mo et al. (2020) try to be robust to an undesirable covariate shift for fear of variations. While one optimistically alters the training population, the other pessimistically considers the worst-possible testing population. In the following I review some discussant comments that stood out to me as particularly keenly perceptive and offer some reflections.

curvature, lll, objective, (13 more...)

2012.0313

Genre: Research Report (0.50)

Industry: Health & Medicine (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)