Print Close

Debiased Digital Twins for Generalizing Treatment Effects from Randomized Trials to Routine Care

Presented During: Section on Statistics in Epidemiology - Measurement Error & Calibration

Yuqing Lei Speaker

Huiyuan Wang Co-Author
University of Pennsylvania

Jie Hu Co-Author
University of Pennsylvania

Yong Chen Co-Author
University of Pennsylvania, Perelman School of Medicine

Monday, Aug 3: 2:50 PM - 3:05 PM
3152
Contributed Papers

Thomas M. Menino Convention & Exhibition Center

Randomized trials provide high-quality causal evidence but often enroll selective populations. By contrast, electronic health records capture broader clinical populations but frequently lack the outcomes needed for reliable treatment-effect estimation. This limits the ability to extend high-quality evidence to routine-care settings when the target outcome is unavailable or incompletely measured. Here we propose a debiased digital twin framework for estimating treatment effects in target populations with covariate-only data. The framework learns outcome models in a source dataset, applies them to the target population to generate paired digital twins, and then uses bias reference outcomes (BROs), whose population-level treatment effects are expected to be null, to detect and calibrate residual bias arising from limited generalizability. The calibration step is a wrapper and can be readily adapted across data settings and digital twin models. In a real-world application transporting brain imaging-based evidence from SPRINT-MIND to a Penn Medicine EHR cohort, evaluation through leave-one-out BRO falsification analyses showed that naive digital twin estimates were systematically biased and severely undercovered, whereas BRO-calibrated estimates were well centered and achieved near-nominal coverage, improving from 2.2% to 95.2%. Across 238 white matter lesion outcomes, the debiased estimates preserved the overall protective pattern associated with intensive blood pressure treatment while enabling inference when the primary outcomes were not routinely observed in the target population. These results support BRO-calibrated digital twins as a practical approach for extending treatment-effect evidence from selective studies to broader clinical populations.

Keywords

Causal Inference

Negative control outcome calibration/ Bias Reference Outcomes

Observational studies

Electronic health records

Main Sponsor

Section on Statistics in Epidemiology