The Evolving Landscape of External Controls and Digital Twins: Current State and Future Directions

Demissie Alemayehu Chair
Pfizer
 
Haitao Chu Discussant
Pfizer
 
Kannan Natarajan Organizer
Pfizer
 
Thursday, Aug 7: 8:30 AM - 10:20 AM
0680 
Topic-Contributed Paper Session 
Music City Center 
Room: CC-101D 

Applied

Yes

Main Sponsor

Section on Bayesian Statistical Science

Co Sponsors

Biometrics Section
Biopharmaceutical Section

Presentations

Calibrated Digital Twins: Improving RCT Analysis with Distributionally Shifted RWD

Randomized controlled trials (RCTs) provide internally valid estimates of treatment effects but are often costly and underpowered. In contrast, real-world data (RWD) offer large-scale, passively collected information that can improve efficiency when integrated appropriately. Recent methods have explored using predictive models trained on RWD—so-called digital twins—to generate individualized outcome predictions and augment RCT analysis. However, when the RWD and RCT populations differ, naively applying external models can induce model shift bias and efficiency loss, undermining the validity of causal conclusions.
We propose a new framework that combines RWD-based digital twin modeling with a calibration step using auxiliary outcomes. This calibration adjusts for systematic discrepancies between the trial and real-world populations, enabling valid and efficient treatment effect estimation in hybrid trial designs. Our approach generalizes classical covariate-adjusted regression and prediction-powered inference to settings with distributional shift between data sources. Theoretically, we show that the proposed estimator is consistent and achieves asymptotic variance no larger than that of the unadjusted baseline under a directional alignment condition. Notably, even if this condition fails to hold, the estimator remains asymptotically unbiased. Empirically, we demonstrate substantial gains in statistical efficiency and robustness through simulations and an application to multi-center neuroimaging data, while maintaining the internal validity of the original RCT.
 

Speaker

Huiyuan Wang, University of Pennsylvania

Virtual Control Groups and Bayesian Statistics in Preclinical Drug Safety Assessment

Utilization of data from historical control animals to form virtual control groups (VCGs) is an innovative approach to embody the 3Rs (reduce, refine, and replace use of control animals) principle in research. However, there is no available systematic evaluation of statistical performance using VCGs in preclinical safety assessment. The optimal selection criteria and combination of VCGs and concurrent control group (CCG) also remain unclear. The VICT3R consortium on VCG controls sponsored by the Innovative Health Initiative (IHI) is currently working to fill in these gaps and refine the implementation of VCGs. This study evaluated the statistical ability as measured by sensitivity and specificity to detect test article effects for body weight and clinical pathology endpoints retrospectively in Pfizer's large animal toxicity studies using VCGs. In addition, exploratory analyses using Bayesian statistics (meta-analytic priors and hierarchical models) were conducted for Developmental and Reproductive Toxicology (DART) and Safety Pharmacology endpoints. Our results show that both VCGs and Bayesian statistics can preserve or improve the sensitivity of detecting toxicologically relevant effects. 

Speaker

Dingzhou Li, Pfizer Inc.

PresentationLL

Speaker

Arup Sinha, FDA/CDER

Utilizing External Data in Practice

In this talk we will discuss the assumptions required for external data to improve inferences in clinical trials, and how different methods produce different ranges and magnitudes of benefit and risk. We will illustrate these assumptions through completed clinical trials that have resulted in regulatory approvals. 

Speaker

Kert Viele, Berry Consultants