Print Close

Did We Personalize? Assessing Personalization by an Online Reinforcement Learning Algorithm Using Resampling

Presented During: New Methods in Causal Inference and Reinforcement Learning for Personalized Decision-Making

Raaz Dwivedi Speaker
UC Berkeley

Wednesday, Aug 7: 10:55 AM - 11:15 AM
Invited Paper Session

Oregon Convention Center

There is a growing interest in using reinforcement learning (RL) to personalize sequences of treatments in digital health to support users in adopting healthier behaviors. Such sequential decision-making problems involve decisions about when to treat and how to treat based on the user's context (e.g., prior activity level, location, etc.). Online RL is a promising data-driven approach for this problem as it learns based on each user's historical responses and uses that knowledge to personalize these decisions. However, to decide whether the RL algorithm should be included in an

optimized'' intervention for real-world deployment, we must assess the data evidence indicating that the RL algorithm is actually personalizing the treatments to its users. Due to the stochasticity in the RL algorithm, one may get a false impression that it is learning in certain states and using this learning to provide specific treatments. We use a working definition of personalization and introduce a resampling-based methodology for investigating whether the personalization exhibited by the RL algorithm is an artifact of the RL algorithm stochasticity, and illustrate it via a mobile health case study.