Modelling dependencies in genetic-marker data and its application to haplotype analysis
The objective of this thesis is to develop new methods to reconstruct haplotypes from phase-unknown genotypes. The need for new methodologies is motivated by the increasing availability of high-resolution marker data for many species. Such markers typically exhibit Linkage Disequilibrium (LD). It is...
Main Author: | |
---|---|
Published: |
University of Edinburgh
2008
|
Subjects: | |
Online Access: | http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.661639 |
id |
ndltd-bl.uk-oai-ethos.bl.uk-661639 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-bl.uk-oai-ethos.bl.uk-6616392018-06-26T03:12:18ZModelling dependencies in genetic-marker data and its application to haplotype analysisSchouten, Michael T.2008The objective of this thesis is to develop new methods to reconstruct haplotypes from phase-unknown genotypes. The need for new methodologies is motivated by the increasing availability of high-resolution marker data for many species. Such markers typically exhibit Linkage Disequilibrium (LD). It is believed that reconstructed haplotypes for markers in high LD can be valuable for a variety of application areas in population genetics, including reconstructing population history and identifying genetic disease variants. The thesis begins with a critical assessment of the limitations of existing methods, and then presents a unified statistical framework that can accommodate pedigree data, unrelated individuals and tightly linked markers. The framework makes use of graphical models, where inference entails representing the relevant joint probability distribution as a graph and the using associated algorithms to facilitate computation. The graphical model formalism provides invaluable tools to facilitate model specification, visualization, and inference. Once the unified framework is developed, a broad range of simulation studies are conducted using previously published haplotype data. Important contributions include demonstrating the different ways in which the haplotype frequency distribution can impact the accuracy of both the phase assignments and haplotype frequency estimates; evaluating the effectiveness of using family data to improve accuracy for different frequency profiles; and, assessing the dangers of treating related individuals as unrelated in an association study.576.58University of Edinburghhttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.661639http://hdl.handle.net/1842/30729Electronic Thesis or Dissertation |
collection |
NDLTD |
sources |
NDLTD |
topic |
576.58 |
spellingShingle |
576.58 Schouten, Michael T. Modelling dependencies in genetic-marker data and its application to haplotype analysis |
description |
The objective of this thesis is to develop new methods to reconstruct haplotypes from phase-unknown genotypes. The need for new methodologies is motivated by the increasing availability of high-resolution marker data for many species. Such markers typically exhibit Linkage Disequilibrium (LD). It is believed that reconstructed haplotypes for markers in high LD can be valuable for a variety of application areas in population genetics, including reconstructing population history and identifying genetic disease variants. The thesis begins with a critical assessment of the limitations of existing methods, and then presents a unified statistical framework that can accommodate pedigree data, unrelated individuals and tightly linked markers. The framework makes use of graphical models, where inference entails representing the relevant joint probability distribution as a graph and the using associated algorithms to facilitate computation. The graphical model formalism provides invaluable tools to facilitate model specification, visualization, and inference. Once the unified framework is developed, a broad range of simulation studies are conducted using previously published haplotype data. Important contributions include demonstrating the different ways in which the haplotype frequency distribution can impact the accuracy of both the phase assignments and haplotype frequency estimates; evaluating the effectiveness of using family data to improve accuracy for different frequency profiles; and, assessing the dangers of treating related individuals as unrelated in an association study. |
author |
Schouten, Michael T. |
author_facet |
Schouten, Michael T. |
author_sort |
Schouten, Michael T. |
title |
Modelling dependencies in genetic-marker data and its application to haplotype analysis |
title_short |
Modelling dependencies in genetic-marker data and its application to haplotype analysis |
title_full |
Modelling dependencies in genetic-marker data and its application to haplotype analysis |
title_fullStr |
Modelling dependencies in genetic-marker data and its application to haplotype analysis |
title_full_unstemmed |
Modelling dependencies in genetic-marker data and its application to haplotype analysis |
title_sort |
modelling dependencies in genetic-marker data and its application to haplotype analysis |
publisher |
University of Edinburgh |
publishDate |
2008 |
url |
http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.661639 |
work_keys_str_mv |
AT schoutenmichaelt modellingdependenciesingeneticmarkerdataanditsapplicationtohaplotypeanalysis |
_version_ |
1718707413617475584 |