Generalized Fitted Q-Iteration with Clustered Data
Hu, Liyuan, Wang, Jitao, Wu, Zhenke, Shi, Chengchun
–arXiv.org Artificial Intelligence
This paper focuses on reinforcement learning (RL) with clustered data, which is commonly encountered in healthcare applications. We propose a generalized fitted Q-iteration (FQI) algorithm that incorporates generalized estimating equations into policy learning to handle the intra-cluster correlations. Theoretically, we demonstrate (i) the optimalities of our Q-function and policy estimators when the correlation structure is correctly specified, and (ii) their consistencies when the structure is mis-specified. Empirically, through simulations and analyses of a mobile health dataset, we find the proposed generalized FQI achieves, on average, a half reduction in regret compared to the standard FQI.
arXiv.org Artificial Intelligence
Oct-7-2025
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe
- France > Hauts-de-France
- Portugal > Porto
- Porto (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Greater London > London (0.04)
- Oxfordshire > Oxford (0.04)
- North America > United States
- Michigan > Washtenaw County
- Ann Arbor (0.14)
- Washington > King County
- Seattle (0.04)
- Michigan > Washtenaw County
- Asia > Middle East
- Genre:
- Research Report (1.00)
- Industry:
- Health & Medicine (1.00)
- Technology: