Nonparametric Bayesian Q-learning for optimization of dynamic treatment regimes

Ashkan Ertefaie Co-Author
University of Rochester
 
Kevin Lynch Co-Author
University of Pennsylvania
 
James McKay Co-Author
University of Pennsylvania
 
Brent Johnson Co-Author
University of Rochester-Medical Center
 
Indrabati Bhattacharya First Author
Florida State University
 
Indrabati Bhattacharya Presenting Author
Florida State University
 
Tuesday, Aug 6: 11:20 AM - 11:35 AM
3029 
Contributed Papers 
Oregon Convention Center 
Existing methods for estimation of dynamic treatment regimes are limited to intention-to-treat analyses--which estimate the effect of randomization to a particular treatment regime without considering the compliance behavior of patients. In this article, we propose a novel nonparametric Bayesian Q-learning approach to construct optimal sequential treatment regimes that adjust for partial compliance. We consider the popular potential compliance framework, where some potential compliances are latent and need to be imputed. The key challenge is learning the joint distribution of the potential compliances, which we accomplish using a Dirichlet process mixture model. Our approach provides two kinds of treatment regimes: (1) conditional regimes that depend on the potential compliance values; and (2) marginal regimes where the potential compliances are marginalized. Extensive simulation studies highlight the usefulness of our method compared to intention-to-treat analyses. We apply our method on the Adaptive Treatment for Alcohol and Cocaine Dependence Study (ENGAGE), where the goal is to construct optimal treatment regimes to engage patients in therapy.

Keywords

Dirichlet process mixture

Endogeneity

Gaussian copula

Intention-to-treat analysis 

Main Sponsor

Biometrics Section