Searching Genome-wide Disease Association Through SNP Data

Taking the advantage of the high-throughput Single Nucleotide Polymorphism (SNP) genotyping technology, Genome-Wide Association Studies (GWASs) are regarded holding promise for unravelling complex relationships between genotype and phenotype. GWASs aim to identify genetic variants associated with di...

Full description

Bibliographic Details
Main Author: Guo, Xuan
Format: Others
Published: ScholarWorks @ Georgia State University 2015
Subjects:
Online Access:http://scholarworks.gsu.edu/cs_diss/101
http://scholarworks.gsu.edu/cgi/viewcontent.cgi?article=1102&context=cs_diss
id ndltd-GEORGIA-oai-scholarworks.gsu.edu-cs_diss-1102
record_format oai_dc
spelling ndltd-GEORGIA-oai-scholarworks.gsu.edu-cs_diss-11022015-07-22T15:43:54Z Searching Genome-wide Disease Association Through SNP Data Guo, Xuan Taking the advantage of the high-throughput Single Nucleotide Polymorphism (SNP) genotyping technology, Genome-Wide Association Studies (GWASs) are regarded holding promise for unravelling complex relationships between genotype and phenotype. GWASs aim to identify genetic variants associated with disease by assaying and analyzing hundreds of thousands of SNPs. Traditional single-locus-based and two-locus-based methods have been standardized and led to many interesting findings. Recently, a substantial number of GWASs indicate that, for most disorders, joint genetic effects (epistatic interaction) across the whole genome are broadly existing in complex traits. At present, identifying high-order epistatic interactions from GWASs is computationally and methodologically challenging. My dissertation research focuses on the problem of searching genome-wide association with considering three frequently encountered scenarios, i.e. one case one control, multi-cases multi-controls, and Linkage Disequilibrium (LD) block structure. For the first scenario, we present a simple and fast method, named DCHE, using dynamic clustering. Also, we design two methods, a Bayesian inference based method and a heuristic method, to detect genome-wide multi-locus epistatic interactions on multiple diseases. For the last scenario, we propose a block-based Bayesian approach to model the LD and conditional disease association simultaneously. Experimental results on both synthetic and real GWAS datasets show that the proposed methods improve the detection accuracy of disease-specific associations and lessen the computational cost compared with current popular methods. 2015-08-11T07:00:00Z text application/pdf http://scholarworks.gsu.edu/cs_diss/101 http://scholarworks.gsu.edu/cgi/viewcontent.cgi?article=1102&context=cs_diss Computer Science Dissertations ScholarWorks @ Georgia State University Algorithm GWAS SNP analysis epistasis clustering Bayesian Theory
collection NDLTD
format Others
sources NDLTD
topic Algorithm
GWAS
SNP analysis
epistasis
clustering
Bayesian Theory
spellingShingle Algorithm
GWAS
SNP analysis
epistasis
clustering
Bayesian Theory
Guo, Xuan
Searching Genome-wide Disease Association Through SNP Data
description Taking the advantage of the high-throughput Single Nucleotide Polymorphism (SNP) genotyping technology, Genome-Wide Association Studies (GWASs) are regarded holding promise for unravelling complex relationships between genotype and phenotype. GWASs aim to identify genetic variants associated with disease by assaying and analyzing hundreds of thousands of SNPs. Traditional single-locus-based and two-locus-based methods have been standardized and led to many interesting findings. Recently, a substantial number of GWASs indicate that, for most disorders, joint genetic effects (epistatic interaction) across the whole genome are broadly existing in complex traits. At present, identifying high-order epistatic interactions from GWASs is computationally and methodologically challenging. My dissertation research focuses on the problem of searching genome-wide association with considering three frequently encountered scenarios, i.e. one case one control, multi-cases multi-controls, and Linkage Disequilibrium (LD) block structure. For the first scenario, we present a simple and fast method, named DCHE, using dynamic clustering. Also, we design two methods, a Bayesian inference based method and a heuristic method, to detect genome-wide multi-locus epistatic interactions on multiple diseases. For the last scenario, we propose a block-based Bayesian approach to model the LD and conditional disease association simultaneously. Experimental results on both synthetic and real GWAS datasets show that the proposed methods improve the detection accuracy of disease-specific associations and lessen the computational cost compared with current popular methods.
author Guo, Xuan
author_facet Guo, Xuan
author_sort Guo, Xuan
title Searching Genome-wide Disease Association Through SNP Data
title_short Searching Genome-wide Disease Association Through SNP Data
title_full Searching Genome-wide Disease Association Through SNP Data
title_fullStr Searching Genome-wide Disease Association Through SNP Data
title_full_unstemmed Searching Genome-wide Disease Association Through SNP Data
title_sort searching genome-wide disease association through snp data
publisher ScholarWorks @ Georgia State University
publishDate 2015
url http://scholarworks.gsu.edu/cs_diss/101
http://scholarworks.gsu.edu/cgi/viewcontent.cgi?article=1102&context=cs_diss
work_keys_str_mv AT guoxuan searchinggenomewidediseaseassociationthroughsnpdata
_version_ 1716808182700965888