RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions

Neural Information Processing Systems 

Mobile health leverages personalized and contextually tailored interventions optimized through bandit and reinforcement learning algorithms. In practice, however, challenges such as participant heterogeneity, nonstationarity, and nonlinear relationships hinder algorithm performance.