한빛사논문
Md. Moksedul Momin 1,2,3,4, Soohyun Lee 5, Naomi R. Wray 6,7, S. Hong Lee 1,2,4
1Australian Centre for Precision Health, University of South Australia, Adelaide, SA 5000, Australia
2UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA 5000, Australia
3Department of Genetics and Animal Breeding, Faculty of Veterinary Medicine, Chattogram Veterinary and Animal Sciences University (CVASU), Khulshi, Chattogram 4225, Bangladesh
4South Australian Health and Medical Research Institute (SAHMRI), University of South Australia, Adelaide, SA 5000, Australia
5Division of Animal Breeding and Genetics, National Institute of Animal Science (NIAS), Cheonan, South Korea
6Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia
7Queensland Brain Institute, University of Queensland, Brisbane, QLD, Australia
Corresponding authors: Md. Moksedul Momin, S. Hong Lee
Abstract
The coefficient of determination (R2) is a well-established measure to indicate the predictive ability of polygenic scores (PGSs). However, the sampling variance of R2 is rarely considered so that 95% confidence intervals (CI) are not usually reported. Moreover, when comparisons are made between PGSs based on different discovery samples, the sampling covariance of R2 is required to test the difference between them. Here, we show how to estimate the variance and covariance of R2 values to assess the 95% CI and p value of the R2 difference. We apply this approach to real data calculating PGSs in 28,880 European participants derived from UK Biobank (UKBB) and Biobank Japan (BBJ) GWAS summary statistics for cholesterol and BMI. We quantify the significantly higher predictive ability of UKBB PGSs compared to BBJ PGSs (p value 7.6e-31 for cholesterol and 1.4e-50 for BMI). A joint model of UKBB and BBJ PGSs significantly improves the predictive ability, compared to a model of UKBB PGS only (p value 3.5e-05 for cholesterol and 1.3e-28 for BMI). We also show that the predictive ability of regulatory SNPs is significantly enriched over non-regulatory SNPs for cholesterol (p value 8.9e-26 for UKBB and 3.8e-17 for BBJ). We suggest that the proposed approach (available in R package r2redux) should be used to test the statistical significance of difference between pairs of PGSs, which may help to draw a correct conclusion about the comparative predictive ability of PGSs.
논문정보
관련 링크
연구자 키워드
관련분야 연구자보기
소속기관 논문보기
관련분야 논문보기
해당논문 저자보기