The Adaptive Doubly Robust Estimator and a Paradox Concerning Logging Policy

Apr-24-2026, 14:56:57 GMT–Neural Information Processing Systems

The doubly robust (DR) estimator, which consists of two nuisance parameters, the conditional mean outcome and the logging policy (the probability of choosing an action), is crucial in causal inference. This paper proposes a DR estimator for dependent samples obtained from adaptive experiments. To obtain an asymptotically normal semiparametric estimator from dependent samples with non-Donsker nuisance estimators, we propose adaptive-fitting as a variant of sample-splitting. We also report an empirical paradox that our proposed DR estimator tends to show better performances compared to other estimators utilizing the true logging policy. While a similar phenomenon is known for estimators with i.i.d.

artificial intelligence, data mining, machine learning, (19 more...)

Neural Information Processing Systems

Apr-24-2026, 14:56:57 GMT

Conferences PDF

Add feedback

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (0.93)

Technology:
- Information Technology
  - Data Science > Data Mining (0.94)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Machine Learning > Statistical Learning
      - Regression (0.93)

Duplicate Docs Excel Report

Title
09e7655fc1dc8fa7c9d6c4478313d5e6-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found