GStream: improving SNP and CNV coverage on genome-wide association studies.

We present GStream, a method that combines genome-wide SNP and CNV genotyping in the Illumina microarray platform with unprecedented accuracy. This new method outperforms previous well-established SNP genotyping software. More importantly, the CNV calling algorithm of GStream dramatically improves t...

Full description

Bibliographic Details
Main Authors: Arnald Alonso, Sara Marsal, Raül Tortosa, Oriol Canela-Xandri, Antonio Julià
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3700900?pdf=render
id doaj-9dbd18cc94cd433ba46d9d3ce5b6b887
record_format Article
spelling doaj-9dbd18cc94cd433ba46d9d3ce5b6b8872020-11-24T21:12:04ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-0187e6882210.1371/journal.pone.0068822GStream: improving SNP and CNV coverage on genome-wide association studies.Arnald AlonsoSara MarsalRaül TortosaOriol Canela-XandriAntonio JuliàWe present GStream, a method that combines genome-wide SNP and CNV genotyping in the Illumina microarray platform with unprecedented accuracy. This new method outperforms previous well-established SNP genotyping software. More importantly, the CNV calling algorithm of GStream dramatically improves the results obtained by previous state-of-the-art methods and yields an accuracy that is close to that obtained by purely CNV-oriented technologies like Comparative Genomic Hybridization (CGH). We demonstrate the superior performance of GStream using microarray data generated from HapMap samples. Using the reference CNV calls generated by the 1000 Genomes Project (1KGP) and well-known studies on whole genome CNV characterization based either on CGH or genotyping microarray technologies, we show that GStream can increase the number of reliably detected variants up to 25% compared to previously developed methods. Furthermore, the increased genome coverage provided by GStream allows the discovery of CNVs in close linkage disequilibrium with SNPs, previously associated with disease risk in published Genome-Wide Association Studies (GWAS). These results could provide important insights into the biological mechanism underlying the detected disease risk association. With GStream, large-scale GWAS will not only benefit from the combined genotyping of SNPs and CNVs at an unprecedented accuracy, but will also take advantage of the computational efficiency of the method.http://europepmc.org/articles/PMC3700900?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Arnald Alonso
Sara Marsal
Raül Tortosa
Oriol Canela-Xandri
Antonio Julià
spellingShingle Arnald Alonso
Sara Marsal
Raül Tortosa
Oriol Canela-Xandri
Antonio Julià
GStream: improving SNP and CNV coverage on genome-wide association studies.
PLoS ONE
author_facet Arnald Alonso
Sara Marsal
Raül Tortosa
Oriol Canela-Xandri
Antonio Julià
author_sort Arnald Alonso
title GStream: improving SNP and CNV coverage on genome-wide association studies.
title_short GStream: improving SNP and CNV coverage on genome-wide association studies.
title_full GStream: improving SNP and CNV coverage on genome-wide association studies.
title_fullStr GStream: improving SNP and CNV coverage on genome-wide association studies.
title_full_unstemmed GStream: improving SNP and CNV coverage on genome-wide association studies.
title_sort gstream: improving snp and cnv coverage on genome-wide association studies.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2013-01-01
description We present GStream, a method that combines genome-wide SNP and CNV genotyping in the Illumina microarray platform with unprecedented accuracy. This new method outperforms previous well-established SNP genotyping software. More importantly, the CNV calling algorithm of GStream dramatically improves the results obtained by previous state-of-the-art methods and yields an accuracy that is close to that obtained by purely CNV-oriented technologies like Comparative Genomic Hybridization (CGH). We demonstrate the superior performance of GStream using microarray data generated from HapMap samples. Using the reference CNV calls generated by the 1000 Genomes Project (1KGP) and well-known studies on whole genome CNV characterization based either on CGH or genotyping microarray technologies, we show that GStream can increase the number of reliably detected variants up to 25% compared to previously developed methods. Furthermore, the increased genome coverage provided by GStream allows the discovery of CNVs in close linkage disequilibrium with SNPs, previously associated with disease risk in published Genome-Wide Association Studies (GWAS). These results could provide important insights into the biological mechanism underlying the detected disease risk association. With GStream, large-scale GWAS will not only benefit from the combined genotyping of SNPs and CNVs at an unprecedented accuracy, but will also take advantage of the computational efficiency of the method.
url http://europepmc.org/articles/PMC3700900?pdf=render
work_keys_str_mv AT arnaldalonso gstreamimprovingsnpandcnvcoverageongenomewideassociationstudies
AT saramarsal gstreamimprovingsnpandcnvcoverageongenomewideassociationstudies
AT raultortosa gstreamimprovingsnpandcnvcoverageongenomewideassociationstudies
AT oriolcanelaxandri gstreamimprovingsnpandcnvcoverageongenomewideassociationstudies
AT antoniojulia gstreamimprovingsnpandcnvcoverageongenomewideassociationstudies
_version_ 1716751609631866880