当前位置: X-MOL 学术BMC Genet. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Overestimated prediction using polygenic prediction derived from summary statistics
BMC Genetics ( IF 2.9 ) Pub Date : 2023-09-14 , DOI: 10.1186/s12863-023-01151-4
David Keetae Park , Mingshen Chen , Seungsoo Kim , Yoonjung Yoonie Joo , Rebekah K. Loving , Hyoung Seop Kim , Jiook Cha , Shinjae Yoo , Jong Hun Kim

When polygenic risk score (PRS) is derived from summary statistics, independence between discovery and test sets cannot be monitored. We compared two types of PRS studies derived from raw genetic data (denoted as rPRS) and the summary statistics for IGAP (sPRS). Two variables with the high heritability in UK Biobank, hypertension, and height, are used to derive an exemplary scale effect of PRS. sPRS without APOE is derived from International Genomics of Alzheimer’s Project (IGAP), which records ΔAUC and ΔR2 of 0.051 ± 0.013 and 0.063 ± 0.015 for Alzheimer’s Disease Sequencing Project (ADSP) and 0.060 and 0.086 for Accelerating Medicine Partnership - Alzheimer’s Disease (AMP-AD). On UK Biobank, rPRS performances for hypertension assuming a similar size of discovery and test sets are 0.0036 ± 0.0027 (ΔAUC) and 0.0032 ± 0.0028 (ΔR2). For height, ΔR2 is 0.029 ± 0.0037. Considering the high heritability of hypertension and height of UK Biobank and sample size of UK Biobank, sPRS results from AD databases are inflated. Independence between discovery and test sets is a well-known basic requirement for PRS studies. However, a lot of PRS studies cannot follow such requirements because of impossible direct comparisons when using summary statistics. Thus, for sPRS, potential duplications should be carefully considered within the same ethnic group.

中文翻译:

使用源自汇总统计的多基因预测进行高估预测

当多基因风险评分 (PRS) 从汇总统计中得出时,无法监控发现集和测试集之间的独立性。我们比较了两种源自原始遗传数据(表示为 rPRS)的 PRS 研究和 IGAP (sPRS) 的汇总统计数据。英国生物银行中两个具有高遗传力的变量——高血压和身高——被用来推导 PRS 的示范性规模效应。不含 APOE 的 sPRS 源自国际阿尔茨海默病基因组学计划 (IGAP),其中阿尔茨海默病测序计划 (ADSP) 的 ΔAUC 和 ΔR2 分别为 0.051 ± 0.013 和 0.063 ± 0.015,加速医学合作伙伴关系 - 阿尔茨海默病 (AMP-广告)。在英国生物银行中,假设发现和测试集大小相似,高血压的 rPRS 表现为 0.0036 ± 0.0027 (ΔAUC) 和 0.0032 ± 0.0028 (ΔR2)。对于身高来说,ΔR2 为 0.029 ± 0.0037。考虑到英国生物银行的高血压和身高的高遗传性以及英国生物银行的样本量,AD数据库的sPRS结果被夸大了。发现集和测试集之间的独立性是 PRS 研究的众所周知的基本要求。然而,许多PRS研究无法遵循这样的要求,因为在使用汇总统计时无法进行直接比较。因此,对于 sPRS,应仔细考虑同一种族群体内潜在的重复。许多PRS研究无法遵循这样的要求,因为在使用汇总统计时无法进行直接比较。因此,对于 sPRS,应仔细考虑同一种族群体内潜在的重复。许多PRS研究无法遵循这样的要求,因为在使用汇总统计时无法进行直接比较。因此,对于 sPRS,应仔细考虑同一种族群体内潜在的重复。
更新日期:2023-09-14
down
wechat
bug