From disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes.

Genome-wide association studies (GWAS) have been fruitful in identifying disease susceptibility loci for common and complex diseases. A remaining question is whether we can quantify individual disease risk based on genotype data, in order to facilitate personalized prevention and treatment for compl...

Full description

Bibliographic Details
Main Authors: Zhi Wei, Kai Wang, Hui-Qi Qu, Haitao Zhang, Jonathan Bradfield, Cecilia Kim, Edward Frackleton, Cuiping Hou, Joseph T Glessner, Rosetta Chiavacci, Charles Stanley, Dimitri Monos, Struan F A Grant, Constantin Polychronakos, Hakon Hakonarson
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2009-10-01
Series:PLoS Genetics
Online Access:http://europepmc.org/articles/PMC2748686?pdf=render
id doaj-5ac29935a4e84924af06b44d74f45310
record_format Article
spelling doaj-5ac29935a4e84924af06b44d74f453102020-11-24T22:05:32ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042009-10-01510e100067810.1371/journal.pgen.1000678From disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes.Zhi WeiKai WangHui-Qi QuHaitao ZhangJonathan BradfieldCecilia KimEdward FrackletonCuiping HouJoseph T GlessnerRosetta ChiavacciCharles StanleyDimitri MonosStruan F A GrantConstantin PolychronakosHakon HakonarsonGenome-wide association studies (GWAS) have been fruitful in identifying disease susceptibility loci for common and complex diseases. A remaining question is whether we can quantify individual disease risk based on genotype data, in order to facilitate personalized prevention and treatment for complex diseases. Previous studies have typically failed to achieve satisfactory performance, primarily due to the use of only a limited number of confirmed susceptibility loci. Here we propose that sophisticated machine-learning approaches with a large ensemble of markers may improve the performance of disease risk assessment. We applied a Support Vector Machine (SVM) algorithm on a GWAS dataset generated on the Affymetrix genotyping platform for type 1 diabetes (T1D) and optimized a risk assessment model with hundreds of markers. We subsequently tested this model on an independent Illumina-genotyped dataset with imputed genotypes (1,008 cases and 1,000 controls), as well as a separate Affymetrix-genotyped dataset (1,529 cases and 1,458 controls), resulting in area under ROC curve (AUC) of approximately 0.84 in both datasets. In contrast, poor performance was achieved when limited to dozens of known susceptibility loci in the SVM model or logistic regression model. Our study suggests that improved disease risk assessment can be achieved by using algorithms that take into account interactions between a large ensemble of markers. We are optimistic that genotype-based disease risk assessment may be feasible for diseases where a notable proportion of the risk has already been captured by SNP arrays.http://europepmc.org/articles/PMC2748686?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Zhi Wei
Kai Wang
Hui-Qi Qu
Haitao Zhang
Jonathan Bradfield
Cecilia Kim
Edward Frackleton
Cuiping Hou
Joseph T Glessner
Rosetta Chiavacci
Charles Stanley
Dimitri Monos
Struan F A Grant
Constantin Polychronakos
Hakon Hakonarson
spellingShingle Zhi Wei
Kai Wang
Hui-Qi Qu
Haitao Zhang
Jonathan Bradfield
Cecilia Kim
Edward Frackleton
Cuiping Hou
Joseph T Glessner
Rosetta Chiavacci
Charles Stanley
Dimitri Monos
Struan F A Grant
Constantin Polychronakos
Hakon Hakonarson
From disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes.
PLoS Genetics
author_facet Zhi Wei
Kai Wang
Hui-Qi Qu
Haitao Zhang
Jonathan Bradfield
Cecilia Kim
Edward Frackleton
Cuiping Hou
Joseph T Glessner
Rosetta Chiavacci
Charles Stanley
Dimitri Monos
Struan F A Grant
Constantin Polychronakos
Hakon Hakonarson
author_sort Zhi Wei
title From disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes.
title_short From disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes.
title_full From disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes.
title_fullStr From disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes.
title_full_unstemmed From disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes.
title_sort from disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes.
publisher Public Library of Science (PLoS)
series PLoS Genetics
issn 1553-7390
1553-7404
publishDate 2009-10-01
description Genome-wide association studies (GWAS) have been fruitful in identifying disease susceptibility loci for common and complex diseases. A remaining question is whether we can quantify individual disease risk based on genotype data, in order to facilitate personalized prevention and treatment for complex diseases. Previous studies have typically failed to achieve satisfactory performance, primarily due to the use of only a limited number of confirmed susceptibility loci. Here we propose that sophisticated machine-learning approaches with a large ensemble of markers may improve the performance of disease risk assessment. We applied a Support Vector Machine (SVM) algorithm on a GWAS dataset generated on the Affymetrix genotyping platform for type 1 diabetes (T1D) and optimized a risk assessment model with hundreds of markers. We subsequently tested this model on an independent Illumina-genotyped dataset with imputed genotypes (1,008 cases and 1,000 controls), as well as a separate Affymetrix-genotyped dataset (1,529 cases and 1,458 controls), resulting in area under ROC curve (AUC) of approximately 0.84 in both datasets. In contrast, poor performance was achieved when limited to dozens of known susceptibility loci in the SVM model or logistic regression model. Our study suggests that improved disease risk assessment can be achieved by using algorithms that take into account interactions between a large ensemble of markers. We are optimistic that genotype-based disease risk assessment may be feasible for diseases where a notable proportion of the risk has already been captured by SNP arrays.
url http://europepmc.org/articles/PMC2748686?pdf=render
work_keys_str_mv AT zhiwei fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
AT kaiwang fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
AT huiqiqu fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
AT haitaozhang fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
AT jonathanbradfield fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
AT ceciliakim fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
AT edwardfrackleton fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
AT cuipinghou fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
AT josephtglessner fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
AT rosettachiavacci fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
AT charlesstanley fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
AT dimitrimonos fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
AT struanfagrant fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
AT constantinpolychronakos fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
AT hakonhakonarson fromdiseaseassociationtoriskassessmentanoptimisticviewfromgenomewideassociationstudiesontype1diabetes
_version_ 1725825999423143936