Significant Sparse Polygenic Risk Scores across 813 traits in UK Biobank

We present a systematic assessment of polygenic risk score (PRS) prediction across more than 1,500 traits using genetic and phenotype data in the UK Biobank. We report 813 sparse PRS models with significant (p < 2.5 x 10−5) incremental predictive performance when compared against the covariate-on...

Full description

Bibliographic Details
Main Authors: Tanigawa, Yosuke (Author), Qian, Junyang (Author), Venkataraman, Guhan (Author), Justesen, Johanne Marie (Author), Li, Ruilin (Author), Tibshirani, Robert (Author), Hastie, Trevor (Author), Rivas, Manuel A. (Author)
Format: Article
Language:English
Published: Cold Spring Harbor Laboratory, 2022-05-23T15:38:08Z.
Subjects:
Online Access:Get fulltext
Description
Summary:We present a systematic assessment of polygenic risk score (PRS) prediction across more than 1,500 traits using genetic and phenotype data in the UK Biobank. We report 813 sparse PRS models with significant (p < 2.5 x 10−5) incremental predictive performance when compared against the covariate-only model that considers age, sex, types of genotyping arrays, and the principal component loadings of genotypes. We report a significant correlation between the number of genetic variants selected in the sparse PRS model and the incremental predictive performance (Spearman's ⍴ = 0.61, p = 2.2 x 10−59 for quantitative traits, ⍴ = 0.21, p = 9.6 x 10−4 for binary traits). The sparse PRS model trained on European individuals showed limited transferability when evaluated on non-European individuals in the UK Biobank. We provide the PRS model weights on the Global Biobank Engine (https://biobankengine.stanford.edu/prs).