Print Close

23: Comparative Evaluation of Statistical Learning Methods for Polygenic Prediction in UK Biobank

Presented During: Contributed Poster Presentations: Korean International Statistical Society

Seunghwan Park Co-Author
Soongsil University

Wonil Chung First Author
Soongsil University

Wonil Chung Presenting Author
Soongsil University

Wednesday, Aug 6: 10:30 AM - 12:20 PM
1631
Contributed Posters

Music City Center

Accurate prediction of complex traits and diseases, is crucial for advancing personalized medicine and preventive healthcare. However, guidelines for selecting optimal PRS models under diverse conditions remain limited, often leaving researchers to rely on generalized assumptions rather than tailored methodologies. In this study, we systematically evaluated PRS prediction models using both simulation-based experiments and real-world datasets, including height, BMI, T2D, and glaucoma. We compared various PRS models across key factors: (1) trait heritability, (2) number of SNPs, (3) proportion of causal SNPs, and (4) trait prevalence. Our results highlight key distinctions between infinitesimal models, which assume all SNPs contribute to traits, and non-infinitesimal models, which consider only a subset of SNPs as causal. Specifically, we demonstrate that non-infinitesimal models, such as LDpred and PRScs, outperform infinitesimal models when the proportion of causal SNPs is low-a characteristic common to many phenotypes. Additionally, sample size was a critical determinant of performance, with LDpred excelling in smaller datasets and PRScs outperforming LDpred in larger datasets.

Keywords

Polygenic risk score (PRS)

PRS models

Computational tools

UK Biobank

Main Sponsor

Korean International Statistical Society