Combining external aggregate information with primary data to improve statistical efficiency

Yu Shen Speaker
UT M.D. Anderson Cancer Center
 
Thursday, Aug 8: 10:55 AM - 11:15 AM
Topic-Contributed Paper Session 
Oregon Convention Center 
In comparative effectiveness research (CER) for rare types of cancer, it is appealing to combine primary cohort data containing detailed tumor profiles together with aggregate information derived from cancer registry databases. Such integration of data may improve statistical efficiency in CER. A major challenge in combining information from different resources, however, is that the aggregate information from the cancer registry databases could be incomparable with the primary cohort data, which are often collected from a single cancer center or a clinical trial. We develop an adaptive estimation procedure, which uses the combined information to determine the degree of information borrowing from the aggregate data of the external resource. The proposed method yields a substantial gain in statistical efficiency over the conventional method using the primary cohort only, and avoids undesirable biases when the given external information is incomparable to the primary cohort. We apply the proposed method to evaluate the long-term effect of trimodality treatment inflammatory breast cancer by tumor subtypes, while combining the IBC patient cohort at MD Anderson and external information.