Asymptotically Best Causal Effect Identification with Multi-Armed Bandits

Mar-21-2025, 08:48:03 GMT–Neural Information Processing Systems

This paper considers the problem of selecting a formula for identifying a causal quantity of interest among a set of available formulas. We assume an sequential setting in which the investigator may alter the data collection mechanism in a data-dependent way with the aim of identifying the formula with lowest asymptotic variance in as few samples as possible. We formalize this setting by using the bestarm-identification bandit framework where the standard goal of learning the arm with the lowest loss is replaced with the goal of learning the arm that will produce the best estimate. We introduce new tools for constructing finite-sample confidence bounds on estimates of the asymptotic variance that account for the estimation of potentially complex nuisance functions, and adapt the best-arm-identification algorithms of LUCB and Successive Elimination to use these bounds. We validate our method by providing upper bounds on the sample complexity and an empirical study on artificially generated data.

artificial intelligence, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Mar-21-2025, 08:48:03 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning
    - Statistical Learning (0.68)
  - Data Science > Data Mining
    - Big Data (0.84)

Duplicate Docs Excel Report

Title
Asymptotically Best Causal Effect Identification with Multi-Armed Bandits Alan Malek

Similar Docs Excel Report more

Title	Similarity	Source
None found