01/11/2023: 12:00 PM - 12:15 PM MST
Contributed
Existing literature for prediction of time-to-event data has primarily focused on risk factors from a single individual-level dataset. These analyses often suffer from rare event rates, small sample sizes, high dimensionality and low signal-to-noise ratios. Incorporating published prediction models from large-scale studies is expected to improve the performance of prognosis prediction on internal individual-level time-to-event data. However, existing integration approaches typically assume that underlying distributions from the external and internal data sources are similar, which is often invalid. To account for challenges including heterogeneity, data sharing, and privacy constraints, we propose a discrete failure time modeling procedure, which utilizes a discrete hazard-based Kullback-Leibler discriminatory information measuring the discrepancy between the published models and the internal dataset. Simulations show the advantage of the proposed method compared with those solely based on the internal data or published models. We apply the proposed method to improve the prediction performance on a kidney transplant dataset from a local hospital by integrating this small-scale dataset with published survival models obtained from the national transplant registry.
Calibration
Data integration
Kidney transplant
Relative entropy
Survival prediction
Presenting Author
Di Wang, University of Michigan
First Author
Di Wang, University of Michigan
CoAuthor(s)
Wen Ye, University of Michigan
Randall Sung, University of Michigan
Hui Jiang, University of Michigan
Jeremy Taylor, University of Michigan
Lisa Ly, Temple University
Kevin (Zhi) He, University of Michigan