Searching Genome-wide Disease Association Through SNP Data
Taking the advantage of the high-throughput Single Nucleotide Polymorphism (SNP) genotyping technology, Genome-Wide Association Studies (GWASs) are regarded holding promise for unravelling complex relationships between genotype and phenotype. GWASs aim to identify genetic variants associated with di...
Main Author: | |
---|---|
Format: | Others |
Published: |
ScholarWorks @ Georgia State University
2015
|
Subjects: | |
Online Access: | http://scholarworks.gsu.edu/cs_diss/101 http://scholarworks.gsu.edu/cgi/viewcontent.cgi?article=1102&context=cs_diss |
id |
ndltd-GEORGIA-oai-scholarworks.gsu.edu-cs_diss-1102 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-GEORGIA-oai-scholarworks.gsu.edu-cs_diss-11022015-07-22T15:43:54Z Searching Genome-wide Disease Association Through SNP Data Guo, Xuan Taking the advantage of the high-throughput Single Nucleotide Polymorphism (SNP) genotyping technology, Genome-Wide Association Studies (GWASs) are regarded holding promise for unravelling complex relationships between genotype and phenotype. GWASs aim to identify genetic variants associated with disease by assaying and analyzing hundreds of thousands of SNPs. Traditional single-locus-based and two-locus-based methods have been standardized and led to many interesting findings. Recently, a substantial number of GWASs indicate that, for most disorders, joint genetic effects (epistatic interaction) across the whole genome are broadly existing in complex traits. At present, identifying high-order epistatic interactions from GWASs is computationally and methodologically challenging. My dissertation research focuses on the problem of searching genome-wide association with considering three frequently encountered scenarios, i.e. one case one control, multi-cases multi-controls, and Linkage Disequilibrium (LD) block structure. For the first scenario, we present a simple and fast method, named DCHE, using dynamic clustering. Also, we design two methods, a Bayesian inference based method and a heuristic method, to detect genome-wide multi-locus epistatic interactions on multiple diseases. For the last scenario, we propose a block-based Bayesian approach to model the LD and conditional disease association simultaneously. Experimental results on both synthetic and real GWAS datasets show that the proposed methods improve the detection accuracy of disease-specific associations and lessen the computational cost compared with current popular methods. 2015-08-11T07:00:00Z text application/pdf http://scholarworks.gsu.edu/cs_diss/101 http://scholarworks.gsu.edu/cgi/viewcontent.cgi?article=1102&context=cs_diss Computer Science Dissertations ScholarWorks @ Georgia State University Algorithm GWAS SNP analysis epistasis clustering Bayesian Theory |
collection |
NDLTD |
format |
Others
|
sources |
NDLTD |
topic |
Algorithm GWAS SNP analysis epistasis clustering Bayesian Theory |
spellingShingle |
Algorithm GWAS SNP analysis epistasis clustering Bayesian Theory Guo, Xuan Searching Genome-wide Disease Association Through SNP Data |
description |
Taking the advantage of the high-throughput Single Nucleotide Polymorphism (SNP) genotyping technology, Genome-Wide Association Studies (GWASs) are regarded holding promise for unravelling complex relationships between genotype and phenotype. GWASs aim to identify genetic variants associated with disease by assaying and analyzing hundreds of thousands of SNPs. Traditional single-locus-based and two-locus-based methods have been standardized and led to many interesting findings. Recently, a substantial number of GWASs indicate that, for most disorders, joint genetic effects (epistatic interaction) across the whole genome are broadly existing in complex traits. At present, identifying high-order epistatic interactions from GWASs is computationally and methodologically challenging.
My dissertation research focuses on the problem of searching genome-wide association with considering three frequently encountered scenarios, i.e. one case one control, multi-cases multi-controls, and Linkage Disequilibrium (LD) block structure. For the first scenario, we present a simple and fast method, named DCHE, using dynamic clustering. Also, we design two methods, a Bayesian inference based method and a heuristic method, to detect genome-wide multi-locus epistatic interactions on multiple diseases. For the last scenario, we propose a block-based Bayesian approach to model the LD and conditional disease association simultaneously. Experimental results on both synthetic and real GWAS datasets show that the proposed methods improve the detection accuracy of disease-specific associations and lessen the computational cost compared with current popular methods. |
author |
Guo, Xuan |
author_facet |
Guo, Xuan |
author_sort |
Guo, Xuan |
title |
Searching Genome-wide Disease Association Through SNP Data |
title_short |
Searching Genome-wide Disease Association Through SNP Data |
title_full |
Searching Genome-wide Disease Association Through SNP Data |
title_fullStr |
Searching Genome-wide Disease Association Through SNP Data |
title_full_unstemmed |
Searching Genome-wide Disease Association Through SNP Data |
title_sort |
searching genome-wide disease association through snp data |
publisher |
ScholarWorks @ Georgia State University |
publishDate |
2015 |
url |
http://scholarworks.gsu.edu/cs_diss/101 http://scholarworks.gsu.edu/cgi/viewcontent.cgi?article=1102&context=cs_diss |
work_keys_str_mv |
AT guoxuan searchinggenomewidediseaseassociationthroughsnpdata |
_version_ |
1716808182700965888 |