A Reinforcement Learning Approach to Estimating Long-term Treatment Effects
Tang, Ziyang, Duan, Yiheng, Zhang, Stephanie, Li, Lihong
–arXiv.org Artificial Intelligence
A/B tests) are a powerful tool for estimating treatment effects, to inform decisions making in business, healthcare and other applications. In many problems, the treatment has a lasting effect that evolves over time. A limitation with randomized experiments is that they do not easily extend to measure long-term effects, since running long experiments is time-consuming and expensive. In this paper, we take a reinforcement learning (RL) approach that estimates the average reward in a Markov process. Motivated by real-world scenarios where the observed state transition is nonstationary, we develop a new algorithm for a class of nonstationary problems, and demonstrate promising results in two synthetic datasets and one online store dataset.
arXiv.org Artificial Intelligence
Oct-14-2022
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America
- Canada
- Alberta (0.14)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States > New York
- New York County > New York City (0.04)
- Canada
- Asia > Middle East
- Genre:
- Research Report
- Experimental Study (1.00)
- Strength High (1.00)
- Research Report
- Industry:
- Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.94)